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@ Hybrid polypeptide containing an avidin binding polypeptide. 



@ A hybrid polypeptide is disclosed comprising 
an avidin-bindtng polypeptide containing a 
biotin binding domain fused to a polypeptide of 
interest wherein the avidrn-binding polypeptide 
is "upstream" of the polypeptide of Interest 
The hybrid polypeptide is produced by reconv 
binant DNA techniques. The hybrid polypeptide 
may also contain a deavage site for deaving the 
polypeptide of interest from the avidin-binding 
polypeptide by using an ai^ropriate proteolytic 
or chemical reagent The hybrid polypeptide is 
expressed in appropriate host cells transformed 
with the DNA expression vector encoding the 
hybrid polypeptide, and may be recovered from 
crude cell extracts In high yield and high purity 
using avidin affinity chromatography. Following 
aNndin affinity purification, the polypeptide tor 
adachment and polypeptide of interest may be 
deaved to yield polypeptide of interest in a 
highly pure and highly active state. 
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The present invention relates to a hybrid pd^eptide. 

In particular, the present invention relates to a recombinant hybrid polypeptide comprising a polypeptide 
of interest fused to an avidin-binding polypeptide. The avid in-binding polypeptide contains a biotin attachment 
domain. 

5 More particularly, the present invention relates to a hybrid polypeptide comprising a pdypeptide of interest 

fused to a biotinylated polypeptide that can bind to avidin. 

The present invention also r^ates to a nucleic acid sequence that encodes for the hybrid polypeptide, a 
process for producing the same and a process for recovering same. 
In particular, the nucleic acid sequence is a DNA expression vector. 
10 Generally, the synthesis of commercially important peptides and proteins has been limited by high produc- 
tion and purification costs and, also, poor product recovery. Unfa! recently, animals, micro-organisms, plants, 
cadavers, serum, and urine have been the only sources from which bioactive polypeptides could be purified. 

However, advances in recombinant DNA technology have made the biological synthesis of valuable poly- 
peptides possible and in commercial quantities. In this regard, recombinant DNA m<^ecules directing the syn- 
15 thesis of conrunercially useful polypeptides can be introduced into procaryotic or eucaryotic expression syst^s. 
For example, recombinant DNA technology has enabled human growth hormone production by recombinant 
bacteria and, today, fenmentation replaces the traditional source. 

Todate, biological synthesis is the only practical approach to the commercial-scale synthesis of peptides 
of greater ttian 20 amino acid residues. Once synthesized, the desired polypeptide product must be purified 
20 from a complex mixture of cellular components. The degree of purification depends upon the intended appli- 
cation of the polypeptide. The cost of purification can account for up to 70% of the cost of producton, as sub- 
stantial losses of active ingredient usually occur during multistep purification processes. 

Polypeptide purifications are usually achieved through one or more processes which are based upon phys- 
ical properties of the polypeptide of interest For example, proteins may be separated on the basis of solubility. 
25 size, ionic properties or affinity for specific ligands; usually several of these techniques are required to achieve 
acceptable purity. 

Affinity resin chroniatography can greatly reduce the number of purification steps required to achieve the 
desired level of purity. Affinity purification is based upon a specific binding interaction between a polypeptide 
to be purified and a ligand which is usually attached to a solid support As used herein, the polypeptide binds 

30 to the ligand by ^rtue of a prosthetic group tK)und to an attachment domain present on the polypeptide. When 
a complex mixture such as a cell extract or crude mixture of synthetic peptides is passed over an affinity resin, 
the polypeptide to be purified is sele<^ely retained by the resin and all molecules lacking the prosthetic group 
on the attachment domain are washed away from the resin. 

Therefore, in a single step, the polypeptide of interest may be recovered in high purity. 

35 In order to use affinity chrontatography to advantage for polypeptide purification, recombinant DNA tedv 
nology can be used to construct chimeric gene fusions for recombinant hybrid polypeptides which in bacterial 
host cells incorporate the following elements: a 5'promoter; DNA coding for a polypeptide of interest; DNA cod- 
ing for a polypeptide that contains a ligand binding domain; and optionally ribosomal terminat(HS, such as the 
nnB terminators found on the E. coli expression vector pkk223-3 (Brosius, j. and Holy A., Proc Nat Acad Sci 

40 USA 81:6929-6933 (1 984); Brosius. j. et al Rasmid 6:112-1 18 (1984)). 

Suitable promoters are those which maximize expression of the desired gene in the host cell, and factors 
to be considered in promoter construction are discussed by Old and Primrose in Chapter 7 of Principles of Gene 
Manipulation 3rd Edition (Kadcwell Scientific Publications, Palo Alto CA 1985). Examples of t)acterial promot- 
ers apfx'opriate for expression of doned genes include the PL, tac, lac, and trp pron[x>ters (ibid). 

45 The DNA used to construct chimeric gene fusions can be obtained firom organisms or can be novel synthetic 
DNA fragments, or combinations thereof. The DNA sequences are assembled into a chimeric gene, which is 
inserted into a DNA expression vector in such a manner that in the appropriate host organism, the polypeptide 
of interest and the polypeptide for attachment to the affinity resin are produced as a single polypeptide chain. 
Other systems for affinity purification of hybrid recombinant polypeptides are known. However, significant 

so technical obstacles limit their use for commercial-scale polypeptide purification. For examine, chimeric genes 
encoding pdypeptkles containing a polyarginine C-terminal tail (Sassenfeld and Brewer, Biotechnology 2:76- 
81 (1984)) or polyhtstidine domain (Smith et al. J. Bid Chem 263:7211 (1988)) can facDitate separation by ion 
exchange or metal chelate ion chromatography. Such systems are not broadly applicable because affinify in- 
teraction depends upon physical properties of the fusion polypeptide (changeability to chelate metals), and it 

55 is not always possible to achieve sufficient change in these physical properties to pennnit affinity binding. 

Anothertype of affinity chromatography is tmnrujnoaffinity chromatography, wherein polypeptides of interest 
are fused to immunogenic proteins such as E. coli beta-galactosidase (Ruther and Muiler-Hill, EMBO J 2:1791- 
1794 (1983)) or small hydraphilic peptides (Hopp etal., 1184,703,004 (1988)) to achieve purification. Polypep- 
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tides fused to staphylococcal protein A can be purified using IgG-Sepharose (Nilsson et al. EMBO J. 4:1075- 
1080 (1985), Lowenadler et al. EMBO J. 5:2393-2398 (1986)). Polypeptides fused to Protein G can be isolated 
using albumin as the immobilized llgand (Nygren et al. J. Mol. Recognition, 1:69 (1988)). 

A critical disadvantage limiting the usefulness of these prior art methods Is that extreme conditions, indud- 
5 ing the use of denaturants, are necessary to remove the fusbn proteins from the affinity resin, vtrtiich may de- 
stroy bicriogical activity if native folding cannot be achieved. Low product recovery rates can also limit the use- 
fulness of such systenns. 

Affinity based upon the binding of small molecules by a large protein is known as substrate-affinity chro- 
matography. A small nDolecule, a ligand, forms a complex with a specific ligate. Examines of ligand:ligate conv 

10 binations include avidin:biot*n (Green in Advances in Protein Chemistry Vol 29 pp 85-133 Anson et al., 
Eds.(1975)), streptavidin:biotin (PCT/US85/01901, Meade and Garvin (1985)), lipoic acid:avidin (Green ibid), 
chloramphenicol acetyl transferase:acetyl CoA(EPO0131383, Bennetetal. (1984)), beta-galactosidase: para- 
aminophenyl-beta-[>-thio-galactoside (Offensberger et al. Proc. Natl Acad. Scl USA 82:7540-7544 (1985)), 
phosphate binding protein: hydroxys patite (Anba et al. Gene 53:219 (1987)), maltose binding protein:starch 

15 (EPO 286239, Guan et al. 1988), and glutathione S-transfierase:glutathione (Smith and Johnson Gene 67:21- 
30 (1988)). 

In recent years, the unique properties of the prosthetic group biotin and its exceptional)y high affinity (1015 
M-1) and speciflcrtyforthe proteins avrdin and streptavidin (Green ibid.) have been exploited to devise powerful 
and widely applicable tools for microbiology, biochemistry and medical science (Wilchek and Bayer Analyt Bio- 
20 chem 171:1-32 (1988), Bayer and Wilchek Methods in Biochem Anal 26:1-45 (1980)). 

Bk)tin is a prosthetic group found on only a few protein species (Ann N.Y>Acad. Sci 447:1-441, Dakshina- 
murti and Bhagavan, Eds. (1985)). Attachment in vivo is mediated by bk>tin holoenzyme synthetases which 
recognizes a highly conserved attachment domain and catalyzes the covalent attachment^of biotin to that do^' 
main (Wood et al. J Biol Chem 225:7397-7409 (1980); Shenoy and Wood, FASCB S 2:2396-2401 (1988)). 
25 Experiments using recombinant DNA technology have shown that biotin holoenzyme synthetases will bio- 

tinylate heterologous polypeptkles containing this conserved attachment domain. For example, the 1 .3S sub- 
unit of the enzyme transcarboxylase from Propionibacterium .^which contains the conserved sequence, when 
cloned and expressed in E. coli is biotinylated by the E coli synthetase (Murtif et al. Proc Nat Acad Sci USA 
82:5617-5621 (1985)). 

30 A polypeptide or part of a polypeptide containing the conserved biotin attachn>ent domain, such as entire 
1.3S (SEQ ID N0:1) protein m the biotin-binding recognition sequence identified within the 1.3S protein from 
Propionibacterium, (SEQ ID NO:2) can be incorporated into a hybrid recombinant polypeptide. Such a hybrid 
polypeptide containing a biotin attachment domain fused to one or more polypeptides of interest could be used 
to achieve the separation of virtually any recombinant protein based upon the affinity of the l^and avklin fcH' 

35 the ligate bbtin. 

Avidin:biotin chromatography shares advantages generally applicable to substrate affinity chromatography 
systems for comnr^ercial-scale polypeptide purification. Substrate-affinity resins are generally inexpensive. Fu- 
sion proteins can be recovered using mild conditions by elution with free ligand. 

Post-translational addition of the biotin prosthetic group is independent of the final folded state of the protein 

40 (Wood et al. J Biol Chem 255:7397-7409 (1980)). an advantage when the host cell performs no post- transla- 
tional modifications on the recombinant polypeptkie. 

A ligand domain such as the donnain directing biotin attachment would therefore be partk)ulariy advanta- 
geous for recovery of fusk>n proteins found in inclusion bodiesorforrecovery of insoluble proteins which require 
denaturants or zwitterionk; detergents for solubilization during extraction, prior to affinity chromatography. 

45 . PCT WO 90/14431, which names Cronan as an inventor, discloses a hybrid DNA sequence encoding a 
fusion protein comprising a first DNA sequence whteh encodes an amino ackt sequence that allows for post- 
translatton nrwdification of the fusion protein; and a second DNA sequence joined end to end with the first DNA 
sequence and in the same reading frame, the second DNA sequence encoding a selected protein or polypep- 
tide. In each of the examples the first DNA sequence is fused to the 3' end of the second DNA sequence (i.e. 

so the first DNA sequence is downstream of the second DNA sequence). 

Also disclosed disclosed in PCT WO 90 / 14431 is a vector comprising the hybrid DNA, a host transformed 
with the vector, a method of producing a fusion protein by culturing the transformed host under conditions per- 
mitting exF^ession of the fusion protein, a fiiston protein comprising a selected protein or pdypeptide linked to 
an amino add sequence that allows for post-translation modification of the fusion protein and a method of iso- 

55 lating the fusion protein comprising providing a binding partner that binds to the fusion protein only after It has 
been modified, contecting the mcxJified fusion protein with the binding partner under conditions permitting bind- 
ing, separating the nrKxjified fusion protein tx)und to the binding partner firom unbound materials in the mbcture, 
and eluting the modified fusion protein. 
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The work of Cronan ts also disclosed In J Bid Chem 265:10327-10333 (1990) (Cronan) wherein a recom- 
binant DNA plasmtd from E. coli (Murtif et al. Proc Nat Acad Sci USA 82:5617-5621 (1985)) was used to con- 
struct fusion genes containing segments of the 1.3S gene, which contain the biotin attachment domain. 

Cronan (ibid) demonstrated that 1.3S sequences can be used to specifically label proteins in vivo, and to 
s purify proteins from crude cell lysates by avidin affinity chromatography. 

As in each of the examples of PCT WO 90/1 4431 . Cronan's (ibid) chimeric genes were constructed by fus- 
ing the 3' end of the genes of interest to the 5' end of the 1.3S gene (i.e. the 1.3S gene is downstream of the 
gene of interest), yielding hybrid recombinant polypeptides having the pdypeptides of interest fused to the N- 
teiminus of the 1.3S polypeptide. 
10 The PCT WO 90/1 4431 and Cronan fusions are consistent with the teachings of Murtif and Samols (J Biol 
Chem 262:11813-11816 (1987)) who teach the fusion of the 3' end of the gene of Interest to the 5' end of the 
1 .3S gene (the N-temninus of the 1 .3S polypeptide) to avoid interfering with the attachment of biotin to Its binding 
domain. Murtif and Samols (ibid) teach that the conformation of the COOH terminus of the 1 .3 S polypeptide, 
and the spatial relationship between this region and a lysine residue positioned exactly 35 residues from the 
15 COOH terminus position to which biotin is attached in vivo, are essential for proper enzymatic recognition and 
biotinylation of the 1.38 polypeptide. 

Murtif and Samols (Ibid) further teach that the conformation of the cartmxyi temiinal region of the 1 .38 poly- 
peptide Is critical for biotinylation, and that altering the hydrophobicity of the carboxyl terminal region of the 1.3S 
polypeptide "eliminates biotinylation." 
20 Murtif Samols did observe biotinylation of 1.3S polypeptides, each lengthened by two amino acids at the 

1.3S cartwxyl terminus. However, such additions of two amino acids to the C-terminus did not substantially 
change its hydrophobicity and such small additions would not be expected to change the conformation of the 
C-terminus. 

US-A-47821 37 discloses a series of recombinant DNA techniques for preparing a hybrid polypeptide con- 

25 sisting of an identification peptide and a desired functional protein. The identification peptide has an antigenic 
terminal and a cleavable linking portion disposed between the antigenic terminal portion and the protein mo^ 
ecule. The linked{linking portion of the klentification peptide is cleavable at a specific amino ackJ residue ad- 
jacent the functional protein by use of a sequence specifK; proteolytic enzyme or chemical agent When the 
protein is cleaved from the isolated hybrid polypeptide, mature functional protein in a highly purified and active 

30 state is released. As with Murtif and Sannols (ibid), PCT WO 90/14431 and Cronan (ibid), US-A-47821 37 only 
discloses the use of a desired functional protein "upstream" of an identification protein. 

US-A-4839293 disloses a process for preparing a fused gene consisting of a streptevidin gene fused to a 
gene encoding the human LDL receptor. Methods are also disclosed that utilise the fused gene to produce lab- 
elled, chemically niodified proteins in viw and also to isolate a protein knowing only tiie nucleotide sequence 

35 of tiie gene encoding the protein. The fused gene comprises a ftrst DNA fragment encoding a target protein of 
interest fused to a second DNA fragment encoding streptavidin which has a multiplicity of binding sites for biotin 
or a bk>tin derivative. The fused gene of US-A-4839293 is capable of expressing a fused protein in vivo when 
the gene is inserted Into a suiteble expression vectOTand introduced into a suitable host cell. The fusion proteins 
are separated by the addition of btotin. Apparently, this method overcomes the diadvantages associated with 

40 the then-known commercial preparations of streptavidin as it utilises a biotin contaminant-free source of strep- 
tevidin which has all four valencies free for bntin binding. 

However, as with Murtif and Samols (ibid), PCT WO 90/14431, Cronan (ibid) and US-A-47821 37, US-A- 
4839293 only discloses the use of a desired functional protein "upstream" of an identification protein. 

The problems associated with the prior art methods can therefore be summarised as follows. First, they 

45 do not allow proteins and the like to be isolated in a high level of purity. Second, tiiey do not allow proteins and 
the like to be isolated in high yields. Third, they do not allow proteins and the like to be separated in high purity 
and yield in a single chromatographic step. Fourthly, they do not provide for the alteration of the carboxyl ter- 
minal region biotin-brnding polypeptide part of the hybrid polypeptides. In fact, and as discussed at>ove, the 
prior art teachings (such as those of Murtif and Samols (ibid), PCT WO 90/14431 and Cronan Obid)) deariy 

50 suggest that altering, tor example, the hydrophobicity of the cariMxyl terminal regton of the 1.3S polypeptide 
would eliminate biotinylation and therefore prevent the isolation of the hybrid polypeptide. 

The sdution to these problems, which the present invention provides, is a recombinant hybrid polypeptide 
(including methods and nudeic ackl sequences for produdng same) comprising a polypeptide of interest fused 
to an avki in-binding polypeptide conteining a biotin attechment domain wherein the pdypeptide of interest is 

55 fused to the C terminus of the avidin-binding pdypeptide (i.e. the pdypeptide of interest (i.e the desired func- 
tional protein) is "downstream" of the avidin-binding polypeptide (i.e. the klentification protein)). 

Thus, according to a first aspect of the present invention there is provided a recombinant hybrid polypeptide 
comprising a polypeptide of interest fused to an avkJ in-binding pdypeptide conteining a biotin attechment do- 
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main, characterised in that the polypeptide of interest is liised to the C tenmlnus of the avidin-binding polypep- 
tide. 

Preferably, biotin is attached to the avidin-binding pdypeptide. 

Preferably, the polypeptide includes a deavage site for cleaving the polypeptkJe of interest from ttie avidin- 
5 binding polypeptide. This cleavage site may be between the polypeptide of interest and the avidin-binding poly- 
peptide or it nr>ay be integral with either or both of the polypeptide of interest and the avidin-binding polypeptide. 

Preferably, the cleavage site is aspartic acid-proline, asparagine-glycine, methionine, cysteine, lysine-pro- 
line, arginine-proline, lysine-arginine or isoleucine-glutamic acid-glycine-arginine. 

Preferably, the avidin binding polypeptide is, or Is part of, a 1.3S polypeptide. Preferably, the 1.3S poly- 
10 peptide is from Proptonibacterium. 

Preferably, the biotin attachment domain of the avidin-binding polypeptide comprises at least one of the 
sequence: 

15 Pro Ala Pro Leu Ala Qy Thr Val Ser Lys De Leu Val Lys Glu 

Gly Asp Thr Val Lys Ala Gly Gin Thr Val Leu Val Leu Glu 
Ala Met Lys Met Glu Thr Qu lie Asn Ala Pro Thr Asp Gly. 

20 

Preferat)ly, the avidin-binding polypeptide comprises a plurality of non-contiguous and / or contiguous avi- 
din-binding polypeptides, which may be the same or different 

Preferably, the polypeptide of interest comprtees a plurality of non-contiguous and /or contiguous polypep- 
tides of interest, which may be the same or different 
25 Preferably, the polypeptide of interest is an enzyme, is an antigen useful far vaccine production, or a di- 
agnostic reagent 

Preferably, the polypeptide of Interest has antitumor activity or has an amino acid sequence for recognition 
of antigens. 

According to a second aspect of the present invention there is iMDvided a nucleic acid sequence coding 
30 for a hybrid polypeptide comprising a polypeptide of interest and an avidin-binding polypeptide containing a 
biotin attachment domain, characterised in that the nucleic acid sequent^ coding for ttie polypeptide of interest 
is downstream of the nucleic acid sequence coding for the avidin-binding polypeptide. 

Preferably, ttie nucleic acid sequence is a DNA sequence. 

Preferably, the DNA sequence contains in a 5' to 3' direction on the coding strand a gene comprising a 5' 
35 promoter region, a DNA sequence coding for the avidin-binding polypeptide and the DNA sequence coding for 
a polypeptide of interest 

Preferably, the DNA sequence is, or is part of, an expression vec^ or a plasmid. 
According to a third aspect of the present invention there is provided a process for the production of a hybrid 
polypeptide according to the first aspect of the present invention comprising constructing a plasmid containing 
40 a nucleic acid sequent^ according to the second aspect of the present invention, transfonning the plasmid into 
a procaryotic or eucaryotic host cell expression system, exfX'essing the system, contacting the hybrid polypep- 
tide resulting from the expression system with avidin, and harvesting the resulting avidin-bound hybrid poly- 
peptide. 

Preferably, the expH^ssion system Is either E. Coli or insect cells. 
45 According to a fourth aspect of the present invention there is provided a process for the isolation of a hybrid 

polypeptide according to the first aspect of the present invention comprising contacting the hybrid polypeptide 
with avidin. 

Preferably, ttie avidin is monomeric avidin, tetrameric avidin orstreptavidin. 

Preferably, ttie polypeptide of interest is cleaved from the isolated hybrid polypeptide. 
50 Preferably, the hybrid polypeptide is isolated using avidin covalentiy bound to a chemically inert, solid, wa- 

ter and solvent insoluble substrate through a chemically stable non-hydrolyzable linking group. 

Preferably the hybrid polypeptide is Isolated using avidin monomer affinity chromatography. Preferably, the 
hybrid polypeptide is isolated using avidin monomer affinity chromatography disclosed in EP-A-0414785 
(90310154.1). 

55 EP-A-04147B5 discloses a monomeric avidin polypeptide ligand and a novel and particularty efficacious 
process ^or isolating synttietic or natural molecules and / or biotinylated derivatives thereof, by adsorbtion of 
ttie nmlecules of interest onto a novel affinity media which contains avidin fixed to a solid inert support 

According to a frflh aspect of the present invention there is provided a first Icit comprising a hybrid polypep- 
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tide according to the first aspect of the present invention and avidin. 

According to a sixth aspect of the present invention there Is provided a second kit comprising a nucleic 
acid sequence according to the second aspect of the present invention and avidin. 

According to a seventh aspect of the present invention there is provided a third kit comprising a nucleic 
5 acid sequence which codes for an avidin-binding polypeptide containing a biotin attachment domain and which 
is fusable to a nudeic acid sequence coding for a polypeptide of interest in order to form a hytMid nucleic acid 
sequence according to the second aspect of the present invention and avidin. 

Preferably, the third Icit comprises means to fuse the nucleic acid sequence coding for the avidin-binding 
polypeptide to the nucleic acid sequence coding for the polypeptide of interest in order to form the nudeic acid 
10 sequence according to the second aspect of the present invention. 

Preferably, any one of the kits comprises means to deave the polypeptkle of interestfirom the avidirhbinding 
polypeptkje. 

In accordance with the present inventkin, therefore, a hybrid polypeptkje comprising one or more polypep- 
tides of interest and at least one pdypeptide for attachment is produced by recombinant DNA methods. 
15 The present invention also [vovkles a process for produdng this hybrid polypeptkle in a procaryotic or a 
eucaryotk: protein expressnn system. 

One of the advantages of the present invention is that it provides a means for Isolating a hyt»id polypeptide 
In high purity and yield. 

Another advantage is that the hybrid polypeptide according to the present invention can t>e recovered in 
20 high purity and high yield In a single chromatographic step, such as the avidin monomer affinity chromatography 
technique disdosed in EP-A-0414785. 

Another advantage of the present Invention resides in the fusk)n of the polypeptide of Interest to the C- 
temninus, and not to the N-terminus, of the polypeptide containing the bntin domain to avidin (e.g. a 1.3S poly- 
peptide). 

25 in this regard, the protein expressk>n level in a host cell is detemilned by a number of factors, induding 

promoter strength and optimal initetton of protein translation (see commentery above). Promoter strength con- 
tributes to the efficiency of transcription of messenger RNA. Optimization of the processes Involved In the inn 
tiation of translatk>n is importent to achieving high levels of protein expression in the host cell. When polypep- 
tides of Interest are Introduced at the 3' terminus of the gene coding for the polypeptide conteining the biotin 

30 domain to avidin (e.g. a 1 .3S gene), no change is made to the optimal placement of the 5' temninus of the 1 .3S 
gene directly adjacent to the promoter and 5' regulatory sequences. Thus, maximal expression levels in host 
cells can be achieved. 

In contrast to this, and as found with the prior art methods, if the polypeptide of interest is inserted between 
the promoter and 5' tenminus of, for example, the 1.3S gens additional expemnination and tailoring is required 

35 to achieve maximal expressk>n levels in host cells. 

The fusing the polypeptide of Interest to ttie C terminus of the polypeptide conteining the bntin domain to 
avidin (e.g. a 1.3S polypeptkle or a fragment of the 1.3S polypeptide), so that the correct (x>nformatk}n of the 
biotin attachment regton may be preserved, is thus in direct contrast to the prior art methods (such as those 
disdosed in Murtif and Sannols (ibid). PCT WO 90/14431. Cronan (ibkl). US-A-4782137 and US-A4839293, 

40 which in fact teach away from the present invention). 

Moreover, it was surprising to find that if one went against the teachings of the [Mior art methods (such as 
those disclosed in Murtif and Samds (ibid), PCT WO 90/14431. Cronan (ibid). US-A-4782137 and US-A- 
4839293) and fused a polypeptide of interest to the C-terminus of the polypeptide containing the biotin dontain 
to avidin, such as a 1.3S polypeptide, the appropriate lysine reskJue of the 1.3S polypeptide within the hybrid 

45 was indeed biotinylated. This is surprising since, if one were to follow the eariier teachings of Murtif and Samols. 
the addition of a polypeptide substantially longer than two amino add resktues woukl t>e expected to alter the 
conformation of the C terminus of the 1.3S polypeptide and thus preclude btotinylation. 

It is also surprising to find that, contrary to the teachings of Cronan (ibid) and Murtif and Samols (ibid), when 
a polypeptide of interest is fused to the polypeptide containing the biotin domain to avklin, such as a 1.3S poly- 

50 peptide, at the C-terminus of the 1.3S polypeptide, bk>tin is attached to the biotin-attachment domain of 1.3S 
polypeptkle within this hybrid recombinant polypeptide. In tiiis regard, Cronan (ibid) and Murtif and Samols (ibid) 
teach that fuston at the C-terminus of the 1 .3S may disrupt the native hydrophobicity and thus the native con- 
fonnation of the 1.3S polypeptide, thus inhibiting bk)tinylation, and consequentiy inhibiting the binding of hybrid 
polypeptides to avidin. 

55 It is further surprising to find that the btotin group attached to the polypeptide containing the btotin domain 

to avklin, such as a 1.3S peptide, fused at its C-terminus to another polypeptide is positioned so as to make 
the biotin molecule avaQable for binding the hybrid polypeptide to the avidin nwnomer affinity resin of EP-A- 
0414785. 
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Furthermore, the hybrid polypeptide is selectively retained by the avidin resin and can be recovered in high 
yield and high purity. 

In the present invention, a binding domain, or recognition sequence, directs the attachment of biotin to the 
hybrid polypeptide. The biotinylated hytnld polypeptide can then be specifically selected by affinity ligand com- 
5 positions such as the avidin monomer resin of EP-A-0414785. 

The present Invention therefore yields a single purirK;ation step that can separate the protein of interest 
from complex mixtures, such as crude bacterial lysates, with high levels of recovery in a single chromatographic 
step, thus alleviating tiie recovery problems inherent to multistep purification processes. Such a combination 
of hybrid polypeptide and avidin monomer affinity resin would clearly confer significant advantages to the pur- 
10 ification of commercially useful polypeptides over existing processes. 

An avidin-binding polypeptide for attachment is generally a polypeptide that enables the attachment of a 
hybrid fusion polypeptkJe to avidin. Among such polypeptides for attachment that may be used are those con- 
taining a recognition sequence for attachment of the prosthetic group biotin, such as the 1 .3S polypeptide sub- 
unit of transcarboxylase from F^pionibacterium. 
15 In addition to using the entire sequence of the 1 .3S polypeptide as the polypeptide for attachment, other 
smaller portions of the 1 .3S polypeptide may be used which direct the attachment of biotin. particularly portions 
comprising all or part of amino acid residues 58 through 100 (SEQ ID NO: 2). 

One or more deletions, substitutions, insertions or mutations may be made by methods well known in the 
art which result in a biotinylated 1.3S polypeptide or biotinylated fragment The nucleotide sequence coding 
20 for the 1.3S polypeptide or filaments may be synthesized using a commercially availabie DNA synthesizer in 
a manner well known in the art 

Additionally, other polypeptides or portions thereof that are enzymatk^aly biotinylated may also be errv 
ployed. 

Unless indicated otherwise, the term "avidin" includes streptavidin. 
25 The polypeptide of interest may include two or nrare polypeptides of interest 
The two or more polypeptides of interest may be fused sequentially. 
Optionally, contiguous polypeptides of interest may be fused sequentially. 

Additionally, more than one polypeptide of interest may be present in a noncontiguous arrangement, for 
example, one polypeptide of interest may be fused to the N-temninus of the polypeptide for attachment and one 
30 polypeptide fused at the G-tenminus of the same polypeptide for attachment 

Two polypeptides of interest may be fused to the C-termini of two polypeptides for attachnnent arranged 
as: polypeptide for attachment 1 - polypeptide of interest 1 - polypeptide for attachment 2 - polypeptide of in- 
terest 2. 

In any of the arrangements disclosed here, the polypeptides of interest may be the same, or different The 
35 polypeptides for attachment may be the same, or different 

An advantage of fusing a plurality of polypeptides of interest to at least one polypeptide for attachment is 
ttie ability to increase ttie yield of a single polypeptide of interest present by including two copies of that poly- 
peptide within a single hybrid pdypeptide, and/or to increase the number of polypeptide spedes that can be 
purified simultaneously by a single avidin affinity chromatography step, if the polypeptides of interest are dif- 
40 lerent 

A cleavage amino ackl or sequence of amino acids may be present between the polypeptide containing 
the biotin dorrain to avidin and the polypeptide of interest 

Likewise, a cleavage amino acid or sequence of amino acids may be present between the polypeptides of 
interest if two or nu>re of such polypeptides are present Such linking, or cleavage, amino acid(s) penmite the 
45 separation of polypeptides at a specific site or sites on the hybrid polypeptide when it is treated with the ap- 
propriate chemical reagent or enzyme. If desired, the cleavage site is positktned adjacent the polypeptide of 
interest so that the polypeptide of interest may be deaved from the polypeptide for attachment. 

The hytHid polypeptide itself may contain a linking amino add or amino ackls for deaving the polypeptide 
or pdypeptides of interest from the polypeptide or polypeptides for attechment The linking amino ackJ or amino 
50 adds are Incorporated between the polypeptide or polypeptides for attachment and the polypeptide or poly- 
peptides of interest in such a way that one or more deavage reactions separate each polypeptide spedes to 
the degree necessary for intended applications. It may not in every instance be necessary to deave all, some, 
or any of the species within a particular hybrid polypeptide. 

Amino adds that may be used to linked{the polypeptide of interest to the polypeptide ftu- attachment indude 
55 aspartic add-proline. asparagine-glycine, methionine, cysteine, lysine-proline, arginine-proline, isoleudne-glu- 
tamic add-glycine-arginine, and the like. 

The at least one polypeptide tor attechment may be deaved from the at least one polypeptide of interest 
by exposure to the appropriate chemical reagent or deaving enzyme. 
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It should be recognized that cleavage of the polypeptide or polypeptides of Interest from the polypeptide 
or polypeptides for attachment may not be necessary for every hybrid fusion polypeptide that Is constnicted, 
in which case a cleavage site could be incorporated, or absent 

The polypeptide of interest can comprise substantially any procaryotic or eucaryotic polypeptide that can 
5 be expressed by a vector in a host cell. Among the polypeptides of interest which may be produced by such 
n^ns are enzymes, such as proteases, oxidoreductases, transferases, hydrolases, lyases, isomerases or lig- 
ases. 

The present invention also contemplates the production of storage polypeptides, such as ferritin or oval- 
bumin or transport polypeptides, such as hemoglobin, serum albumin, eruloplasmin, or the like. Also included 
10 are the types of polypeptides that function in contractile and motile systems, for example actin and myosin or 
the like. 

The present inventkin also contemplates the productton of polypeptides that serve a protective or defense 
function, such as the blood polypeptides thrombin and fibrinogen. 

Other protective pdypeptides include the binding polypeptides, such as antibodies or immunoglobulins that 
15 bind to and thus neutralize antigens. Additnnally this inventk>n contemplates Protein A. or the like. 

The polypeptide produced by the present invention also may encompass various hormones such as en- 
dorphins, human growth hormone, somatostatin, prolactin, estrogen, progesterone, thiyotropin, calcitonin, go- 
nadotropin, insulin or the like. 

Other such hormones include those that have been identified as being involved in ttie immune system, such 
20 as interteukin 1. interieukin 2, colony stimulating factor, nnacrophage-activating factor, interferon, or the like. 

The present invention be used to produce toxic polypeptides, such as ricin from castor bean or gos- 
sypin from cotton seed, and the like. 

Polypeptides that serve as structural elements may be produced by the present inventkin. such polypep- 
tides include the fit)rous polypeptides collagen, elastin and alpha-keratin. 
25 Other structural pdypeptides include glycoproteins, virus- proteins, muoo- proteins and the like. 

Polypeptides that may be utilized as diagnostic agents, for example as markers for the presence of certain 
diseases, are also contemplated by this invention. 

Additional polypeptides of interest that may be produced as hybrid polypeptides are polypeptides that may 
be used for therapeutic purposes, for example polypeptkJes with anti-tunrior activity, polypeptides useful in vac- 
30 dne production, polypeptides having amino acid sequences for recognition of antigens, or polypeptides which 
can function as diagnostic reagents, and the like. 

In addition to the above-noted naturally occurring polypeptides, the present invention may t>e used to pro- 
duce synthetic polypeptides, defined generally as any sequence of amino acids not occurring in nature. 

Preferably, the hybrkj polypeptide is produced in procaryotic or eucaryotic cells transformed by a cloning 
35 vector comprising a nucleic ackd sequence according to the present invention. The hybrid polypeptide is then 
purified away from tite complex cell extract mixture by avidin affinity chromatography. A partrcularty prsfen'ed 
fonm of avidin is avidin monomer. 

In a prefered embodiment, an extract of transformed cells is made from cell culture or fermentation t»roth, 
the hytirid polypeptide is then rendered to a soluble state, and the extract is then applied to the avidin monomer 
40 column. The column is then washed with adequate amounte of a wash buffer to dear the column of unbound 
materials. The hybrid polypeptide is then eluted from tiie cxdumn. 

After the hybrid polypeptide is eluted from the column, the polypeptide tor attachment may optionally be 
deaved from the polypeptide of interest with the appropriate deavage reagent or enzyme. Passage of the 
deaved mixture over the avidin monomer column yields a highly purified preparation of the polypeptide of in- 
45 terest, and the p>olypeptide for attachment is then retained by the column. 

The avidin used to bind the bntin attached to the hybrid polypeptide nnay be monomeric or tetrameric avidin, 
or streptevidin. Avidin monomer is the preferred form of avidin affinity medium. 

Advanteges of using avidin monomer to operate the polypeptide of interest from crude cell mixtures in- 
dude reversible binding of the polypeptide for attechment to avidin, high yield, and high purity of the desired 
50 pdypeptide of interest follow! ng affin ity chronnatography. 

The genes coding for hybrkl pdypeptides may be produced by recombinant DNA methods by combining 
within a DNA expression vector a chimeric gene comprising a 5' pronwter region. DNA sequences coding for 
at least one polypeptide tor attechment of a prosthetic group iot binding to avklin, and at least one DNA se- 
quence coding for a polypeptide of interest 
55 Optionally, the chimeric gene nr\ay contein at least one DNA sequence coding for a linking amino add or 
amino adds, that is. one or more amino adds for deaving a polypeptide of interest from a pdypeptide for at- 
techment 

Genes coding for the various types of polypeptides of interest, for example those identified above, may be 
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obtained Ircan a variefy of procaryotic or eucaryotic sources, such as plant or animal cells or t>acterial cells. 
Genes can be isolated from chromosomal material of eucaryotic or procaryotic cells, or from plasmids or viruses 
of procaryotic or eucaryotic cells by employing standard, well-known techniques. 

Additionally, automated DMA synthesis may be used to obtain DNA coding for naturally-occurring or syn- 
5 thetic polypeptides. To enable chimeric gene expression in host cells, a variety of naturally-occurring and syn- 
thesized DNA expression vectors having genes coding for many different polypeptide molecules are nowcorrv 
mercially available from a variety of sources. 

The desired DNA can also be produced from mRNA by using the enzyme reverse transcriptase. This en- 
zyme permits the synthesis of DNA from an RNA template. 
10 In accordance with the present invention, once genes coding for one or more desired polypeptides of in- 
terest are isolated, synthesized or otherwise obteined, said gene or genes are joined to at least one gene coding 
for a polypeptide conteining a recognition sequence for attechment of the prosthetic group biotin, thus enabling 
the attechment of the hybrid polypeptide to avidin. 

A gene directing the synthesis of a polypeptide for attechment is generally one coding for a polypeptide 
15 that enables the binding of a hybrid fusion polypeptide to avidin. Antong such genes coding for polypeptides 
for attachment that may be used are those coding for amino acid sequences that direct the attechment of the 
prosthetic group biotin , for example the gene for the 1 .3S polypeptide subunit of transcartwxylase from Pro- 
pionibacterium. 

A gene coding fcH- a polypeptide for attechment may be one that directs the attechment of the prosthetic 
20 group biotin. A particularly biotinylated preferred biotinylated polypeptide for attechment is the 1 .3S subunit of 
transcart>oxylase from Proplonibacterium shermanii (SEQ ID N0:1). Although the gene coding for the entire 
1.3S polypeptide from Propionibacterium shermanii is preferrsd(SEQ ID NO: 4), optionally any gene or gene 
fragnrtent coding for a polypeptide that directs the attechment of biotin may be suiteble. The preparation of gene 
fragments is well known to those skilled in the art 
25 The gene or genes coding fcM* the at least one polypeptide of interest and the gene or genes coding for the 
polypeptide or polypeptides for attechment are preferably treated with the appropriate restriction enzymes, or 
otherwise treated to have cohesive termini to facflttete ligation with ottier elements of the chimeric gene or the 
DNA expression vector. 

The resulting DNA expression vector carrying the chimeric hybrid polypeptide gene is used to transform 
30 the approf»iate procaryotic or eucaryotic host cell. The selection of a DNA expression vector appropriate for 
tiie desired host cell is well known to those skilled in the art 

Following ttie transformatton procedure, the transfcvmed host cells are iscdated and analyzed for expres- 
sion of the hybrid pdypeptide. Those transformante Identified as conteining the hybrid polypeptide are further 
analyzed by restriction enzyme digestk>n, DNA sequencing and other methods for confirming the correctness 
35 of the desired gene, by methods well known to those skilled in the art 

The transfonmante identified as host cells carrying the gene for the desired hybrid polypeptide are ttien mul- 
tiplied in cultore to cause replication of the vector and high-level expressk>n of the hybrid polypeptide that con- 
teins the polype ptkle of interest 

The ctoning vector may be used to transform additionally other strains of compatible hoste for large-scale 
40 production of the hybrid polypeptide. 

Various methods used for obteining genes or gene firagmente, preparing DNA expression vectors, trans- 
forming host cells, expressing hybrki polypeptides in host cells, and identifying those polypeptides are set forth 
by J. Sambrook, E.F. Fritsch, andT. Maniatis, Molecular Cloning, 2nd Edition. Cold Spring hterbor Press, 1989, 
and also by F.M.Ausubel, R. Brent R-E. Kingston, D.M. Moore, J.G. Seidman. J.A Smith, K. Stnihl., Eds. Cur- 
45 rent Protocds in Molecular Biology, Volume 1 . John Wiley and Sons, New York 1989. 

To prepare DNA expression vectors, various doning vectors may be used. A plasm id is preferred. However, 
a cosmid or bacteriophage may be used. If insect, plant or mammalian cells are used as host cells, viruses 
n^y also be used as vectors. DNA expression vectors may be obteined from natural sources or may comprise 
synthetic DNA. The plasmid chosen for a particular expression system should be compatible witii that host to 
so ensure vector replication and polypeptide expression. The plasmid chosen for incorporation of the genes coding 
for the hybrid polypeptide should possess an origin of replication recognized by the host cell. 

The DNA expression vector should contein DNA sequences recognized by restriction endonudease en- 
zymes to deave the vector for subsequent ligation with the gene for the hybrid polypeptide without inactivating 
the origin of replication or functions necessary for plasmid selection following transformation, for example within 
55 an antibbtic resistence gene. The vector should contein restriction enzyme deavage sites that provkle suiteble 
termini for joining and ligatfon of ftveign genes to be Inserted. 

Preferably, the DNA vectOT contains a single site or two unique sites for incorporation of the hybrid poly- 
peptide gene, neither of which occurs within that gene. 

9 



EP0 511 747 A1 



To accommodate more than one different foreign gene possibly temninating in different cohesive or blunt 
tennini, it would be useful for the vector to possess a large number of unique restriction enzyme cleavage sites. 

Preferably, the DNA expression vector will cause a phenotype to be expressed thatwBI enable transformed 
cells to be readily identified and separated from cells which do not undergo transformation. Such phenotypic 
5 sdection genes can include genes providing resistance to a particular antibiotic, which inhibits growth of un- 
transformed but not transfonned cells. Such genes are widely available now and confer resistance to antibiotics 
such as ampiciltin, tetracycline, streptomycin, kanamycin, and the like. 

Plasmids which contain an inserted gene that disrupts the B-galactosidase gene, such as the hybrid poly- 
peptide gene, can be identified following transformation by the inability of the host cell to reduce reagent 5- 
10 bromo-4-chloro-3 indoly)-t>-D-ga)actopyranoside (X-gal) in the media and cause the bacterial cdony to develop 
a blue coloration. Such plasmids. reagents, and media are known to those sidlled in the art 

Preferably, E. coll is employed as the host cell, and a plasmid is prefenred for cloning and transformation 
of the E. cdi host The preferred plasmid is pKiC223-2{Phanmacia, Uppsala, Sweden). This plasmid carries 
genes for an origin of replication in E. cdi, and a gene for resistance to the antibiotic ampicfllin. This plasmid 
15 also has a synthetic linker regk>n consisting of unique restriction endonuclease cleavage sites to facilitate clon- 
ing. This plasmkj contains the strong tac pronvoter, which directs high levels of transcriptk>n in E. cdi. 

If insect cell culture is to t>e used for production of hybrid pdypeptkte, the preferred plasmid is pVL1392 
(obtained from M. Summers, University of Texas, available commercially from Invitrogen, Inc. San Diego CA). 

An advantage of insect cell culture is that polypeptides requiring glycosylation or other types of post tians- 
20 lational nrxxifications induding folding with appropriate disulfide bond fonrtation may be so modified in an insect 
cell expression system, whereas this manner of post-translatkinal modificatton is not performed by procaryotic 
hosts. 

To prepare the chosen plasmid for insertion of the chimeric gene comprising the hybrid pdypeptkle gene, 
the [dasmid is digested with restriction endonudeases, for example BamHI or EroRI, or any restriction enzyme 
25 or enzyme combinatk>n that deaves the plasmid at a u nique site and produces cohesive 3' and 5' termini com- 
plementary to termini of the chimeric gene to be ligated . If desired, the plasmid may be treated mth two diffierent 
ertzynoes to produce two different cohesive termini to fadlitate ligation of the chimeric hybrid polypeptide gene 
or genes in the correct orientation within the plasmid. Certain enzymes which produce blunt ends may also be 
used, or linker molecules may be added to vector or foreign genes to prepare the desired cohesive termini. 
30 Such strategies and methods are well known to those skOled in the art 

When the plasmid is digested, two or more DNA fragments n^y be generated. The desired plasmid firag- 
ment carrying the origin of replication and other genes essential to replication and identrTication of the plasmid 
may be identified and recovered by gel electrophoresis and other techniques well known in the art 

A particulariy preten^d anrangement tor the members of the hybrid polypeptide is the locatnn of the pdy- 
55 peptide for attachment most preferably the 1.3S polypeptide directly 3' to the promoter at the PstI site within 
the synthetk; linker of the plasmid pkk223-3. The Pstt site is 3' to the promoter and to a ribosome binding site. 

It should t>e understood that any delettons, inserttons, substitutwns. or mutetions which may be performed 
on the 1.3S gene which still direct the attechment of biotin are contemF^ated within the spirit and scope of this 
invention. Additionally, other genes or gene fragments, natural or synthetic, whose resulting polypeptides direct 
40 the attachment of btotin or lipoic add fall within the scope and spirit of this invention. 

The 1.3S gene is preferat^y constructed with a deavage site for PstI at rte 5' terminus and a deavage site 
for BamHI at its 3' terminus, such that upon ligation, the 1.3S gene is connected in the proper reading frame 
with the tac promoter, a ribosome binding site is inted, and the 1.3S gene or fragment preferably terminates 
in the nudeotide sequence GAT CCA TAA CGC CTA AGC TT (SEQ ID NO: 3). or any such sequence which 
45 simulteneously provides a BamHI restriction endonudease deavage site and codes for the amino adds asp- 
pro. Asp pro is the preferred sequence used as linking amino adds the deavage of the 1.3S pdypeptide for 
attachment from appropriate polypeptides of interest 

However, if a pdypeptide of interest contains within ite sequence one or more asp-pro sequences, then 
optionally any other linking amino add or amino acids not present in the polypeptide of interest may be sut>- 
50 stituted for asp-pro. It will be necessary to structure the gene in such instances that appropriate cohesive temnini 
are created that pfflmit ligation of the 1 .3S gene to the gene for the pdypeptide of interest 

The gene or genes for the at least one polypeptide of interest may be isolated, synthesized, or otherwise 
obtained and modified at the 5' t^minus so that ligation to the appropriate tenminus of tiie gene for the pdy- 
peptide for attechment is facilitated. The 3' tenninus of the 1.3S polypeptide for attachment is the preferred 
55 tenminus for ligation of the gene for the pdypeptide of Interest in the proper reading frame. 

Furthermore, the 3' terminus of the last pdypeptide of interest in sequence in the chimeric gene should 
preferably be prepared so that this terminus is complementary to the 5' tenminus of the plasmid wecXor, to fa- 
cilitate ligation to the expresston vector. 
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It is to be understood that for different chimeric genes, obtaining the con^ct orientation of the polypeptide 
or polypeptides of interest relative to each other and to the polypeptide or polypeptides for attachment within 
the expression vector employ the basic steps as outlined atx>ve. 

Preferably, the gene for each polypeptide of interest is attached in the proper reading frame to the adjacent 
5 gene, and all adjacent temnini are prepared in a manner so as to make them complementary to facilitate ligation. 

Furtherm(»«, genes for any polypeptides of interest requiring deavage from another polypeptide of interest 
or from one or more polypeptides for attachment must t>e so constructed as to allow for the proper positioning 
of all deavage sites so that their insertion does not result in any genes for any of the polypeptides t)eing in an 
improper reading frame. 

10 The ligation reaction, which covalently joins fragments of DNA, is described in Sambrook, Fritsch, and Man- 
iatis (ibid), and Ausubel et al. (ibid) and is well known to those skDIed in the art 

The ligated plasmid is ready for transformation of host cells. The preferred host is £. cdi, however, other 
bacteria, Insect cells, yeast, or mammalian or plant cells may be used with a DMA expression vector appropriate 
to that particular host cell. 

IS Transformation of E. coli is a standard procedure well known to those skilled in the art, wherein a suitable 
host strain, such as E. cdi HB101 accepts, harbors, replicates and expresses the plasmid carrying the gene 
for the hybrid pol^ptide. Transformation of E. coli is described by Sambrook, et al. If the host is an insect 
cell, transfection may be accomplished by a procedure such as that described by M. Summers and G. Smith, 
A Manual of Methods for Baculovlrus Vectors and Insect Cell Culture Procedures. Texas Agricultural Experi- 

20 ment Station Bulletin No. 155, 1988. 

In order to identify the host cells which are transformed, the culture is placed in selective media containing 
an appropriate antibk>tic Only those cells with plasmid-bome resistance will survive. 

Plasmid can be recovered after lysis of surviving cell colonies, and characterized by restrictnn enzyme 
digestion and mapping, DNA sequencing, or other methods known in the art Additionally, those colonies which 

25 express hybrid polypeptide can be identified by immunological assay, such as ELISA or Westem blotting. In 
some embodiments it may be possible to assay directly for biological activity of the polypeptide or polypeptkJes 
of interest 

Once transfomied cells carrying the hybrid polypeptide are identified, they may t>e multiplied by established 
techniques, such as fermentatnn. In addition, the recovered plasmkis can be used to transfonm other strains 
30 of bacteria, or appropriate hosts cells for large-scale production and expresskin of the hybrid polypeptkie. 

The hybrid polypeptide which contains the polypeptMe for attachment, the biotin group for binding to avidin, 
the polypeptkie of interest and the optional deavage site, expressed by the transfonmed host cells may be 
separated from the medium and ottier debris by affinity chromatography. 

The preferred affinity medium is the avidin monomer resin described in EP-A-0414765. To this end, host 
35 cells are separated from the medium and broken open for example, by sonication. 

Optionally, hybrid polypeptkles can be excreted into tiie culture media if a signal peptkle for extracellular 
secretion is induded at the appropriate terminus of the hybrid polypeptide. 

Should such a secreted polypeptide be desired, it may be necessary to indude a DNA sequence coding 
for a polypeptide directing extracellular secretin within the chimeric gene coding for the hybrid polypeptide. 
40 The hybrid polypeptkie once released is maintained in an appropriate buffer, preferably one in which it is 
soluble. The buffer solution should be formulated to maximize hybrid polypeptide recovery from host cells. Buf- 
fer properties whk;h may be optimized to favor recovery indude but are not limited to, pH. ionic composition, 
bnic strength, or presence or absence of various detergent compositions. 

Optionally some fractioniation of the host cell extract may be performed in order to concentrate or partially 
45 purify the hybrid polypeptide prior to afFinrty chromatography. 

One preferred method is amrmnium sulfate fractionation. It is to be understood that other methods conv 
monly employed in protein purification may also be used prior to affinity chromatography. The cell extract is 
passed over til e preferred column for affinity chromatography, the avidin nwnomer column, which is ttien wash- 
ed extensively with buffer to renf>ove all unbound materials. 
50 The hybrid polypeptide is specifically eluted from the column, preferatrfy with acetic ackJ or biotin. As as 
result a high yield of highly purified hybrid polypeptide containing the polypeptide of interest is obtained. 

It may be desirable or necessary to deave the one or more polypeptide of interest from the one or more 
polypeptkie for attachment to restore bk>logical activity to the polypeptide of interest 

Separation from the polypeptide for attachment may t>e accomplished by first suspending the hybrid pdy- 
55 peptide in buffer. Thereafter ttie chemical or proteolytic deavage agent specific to the linking amino acid or 
amino adds Is added (o the suspension and the polypeptkie of interest is deaved. 

Fc»' example, if the polypeptide of interest is linked to the polypeptide for attachment by an asp-pro linkage, 
a volatile acid such as formic add may be added to ttie suspension to effect deavage. 
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If methbnine is the linking amino add, the reagent cyanogen t}romide may be used to cleave between me- 
thionine and the first amino acid of the polypeptide of interest 

A volatile cleavage reagent, such as formic acid or cyanogen bromide, may t>e evaporated away from the 
polypeptide mixture, tf cleavage is accomplished by an enzyme, the enzyme may be removed from the mixture 
5 by passing the mixture through an enzyme substrate column. 

If it is necessary to obtain the polypeptide of interest pure from the polypeptide for attachment, this removal 
may be accomplished by passing the mixture through an avidin affinity column. In this way, the polypeptide for 
attachment by binding to the avidin, will be retained and therefore separated fircMin the highly purified solution 
of the polypeptide of interest which will not bind to avidin. 
10 It should be noted that some polypeptides of interest will assume their desired biological activity with the 
polypeptide for attachment still attached. As a consequence, the polypeptide for attachment will not need to 
be cleaved from the polypeptide of interest and the steps described to separate the polypeptide of interest and 
the polypeptide for attachment need not be performed. Moreover, in circumstances where the polypeptide for 
attachment remains attached to the polypeptide of interest linking amino acid or amino acids may be present 
IS or omitted. In this situation, the constructk>n and method of preparing the DNA expression vectors. detaOed 
above, can be appropriately modified. 

The present invention will now t>e described by way of examples only. 

Reference shall be made to tite following figures, in which: 

Figure 1 shows a partial restrictk>n n>ap of plasmid ptac1.3dp; 
20 Figure 2 shows chimeric gene constructs for hybrid polypeptides constructed so that the polypeptide of 

interest is fused at the C-tenminus of the polypeptkle for attachment; 

Figure 3 shows chimeric genes for hybrid p<riypeptides containing more than one polypeptkJe of interest 
fused to a single polypeptide for attachment; and 

Figure 4 shows chimeric gene constmcts for a hybrid polypeptide containing two noncontiguous polypep- 
25 tides of interest each fused to the C-terminus of noncontiguous polypeptides for attachment 

Referring to Figure 1, a partial restriction map of plasmid ptac1.3dp is shown. ptad.Sdp was created by 
nnodification of plasmids ptac1.3tand ptac1.3(1-125) obtained from D. Samols, Case Western Reserve Uni- 
versity. An E. Cdi strain CSH26 containing the plasmid ptac1.3dp has been deposited in the American Type 
Culture Collection, Rockville, Md, USA as ATCC No. 68937. This deposit was made pursuant to the Budapest 
30 Treaty On The Intemational Recognition Of The Deposit Of Microorganisms For The Purposes Of Patent Pro- 
cedure. 

Referring to Figure 2, chimeric gene constructs for hybrid polypeptides are shown. The chimeric genes are 
constructed so that the polypeptide of interest is fused at tiie C-terminus of the polypeptide for attachnient In 
these examples, the polypeptide of interest is a synthetic t}-endorphin. and the p>olypeptide for attachment is 
35 the 1 .38 polypeptide from transcartxixylase of f^pionibacterium shermanii. 

In Figure 2A is shown an asp- pro cleavage site bcated between the C-tenminus of the 1.3S polypeptide 
and the N-terminus of the b-endorphin polypeptide. 

In Figure 2B is shown an asp-pro cleavage site located between the C-temninus of the 1.3S polypeptide 
and the N-tenrninus of a novel reverse-endorphin polypeptide. 
40 In Figure 2C is shown a methionine cleavage site located between the C-tenninus of the 1 .3S polypeptide 
and the N-terminus of the b-endorphin polypeptide. 

In Figure 2D, no cleavage site is located t>etween tiie C-terminus of the 1 .38 polypeptkJe and the N-ter- 
minus of the t>-endorphin polypeptide. 

Referring to Figure 3, chinneric genes for hybrid polypeptides are illustrated which contain more tiian one 
45 polypeptide of interest fused to a single polypeptide for attachment 

In Figure 3A is shown the fusk>n of two contiguous t>-endorphin polypeptides to the C-terminus of the 1.38 
polypeptide from transcartwxylase of Propioni bacterium shermanii. 

In Figure 3B is illustrated the fusion of two different noncontiguous polypeptides of interest to the 1 .38 poly- 
peptide. A maltose binding protein is fused to the N-terminus of the 1.38 pdypeptide, and a synthetic b-endor- 
50 phin is fused to the C-terminus of the same 1 .38 polypeptide. 

Figure 30 shows the fusk)n of two different contiguous polypeptides of interest to the N-terminus of the 
1.38 polypeptide. 

The maltose binding protein and a synthetic b-endorphin pdypeptide are fused in tandem to the N-terminus 
of the 1 .38 polypeptide. 

55 Referring to Figure 4, chimeric gene constructs for a hybrid polypeptide containing two noncontiguous poly- 
peptides of interest are shown. Each coding sequence is fused to the C-teminus of a noncontiguous polypep- 
tides for attachment In the specific example, two synthetic b-endorphin polypeptides are eadi (used to the C- 
tenminus of a different 1.38 polypeptide. 
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The fdlowing examples are carried out using one or more of the general procedures set forth below. 
In all of the following examples, restriction endonudeases, ligases, polymerases, and other DMA nxxJifying 
enzymes described in specific experimental steps are used according to the recommendations of the manu- 
facturer of the particular enzyme or reagent used. 
5 Two laboratory manuals, Current Protocols in Molecular Biology (Ausut>el et al 1989)), and Molecular Clon- 

ing A Laboratory Manual 2nd Edition (Sambrook et al, Cold Spring Hartwr Press, Cold Spring Haibor, NY 
(1989)) referenced below, contain supplemental information that may also be useful to one skilled in the art in 
conducting the examples described below. 

10 Procedure I. Restriction Endonudease Digestion of DNA 

Restrictton endonudease digestions using one or more restriction enzymes to digest DNA are generally 
carried out using the protocols set forth in Ausubel etal. Current Protocols in Molecular Biology Volume 1 , Chap- 
ter 3, Unit 3.1. 

15 Restrictton mapping of plasmids is generally carried out using the protocols set forth in Ausubel et al (ibid). 
Unit 3.2. Restriction enzymes are obtained from Promega (Madison Wl) or New England Biolabs (Beverly MA) 
and complete or partial digestion of DNA with specific enzymes are performed generally according to the man- 
ufacturer's recommendations . 

20 Procedure II. Puriflcatnn of DNA Fragments Using Agarose Gel ElectDrphoresls. 

Agarose gel electrophoresis is generally carried out using the protocols set forth in Ausubel et al (ibid). 
Volume I, Chapter 2, Unit2.5A. Separation and isolation of larger (>1 kb) DNA fragments from exdsed gel frag- 
ments is generally carried out as set forth by Ausubel et al (ibid). Chapter 2, Unit 2.6, and separation and iso- 
25 lation of smaller (<1 kb) DNA firagments is carried out as set forth in Ausubel et al (ibid) Chapter 2. Unit 2.7. 
Removal of salts and gel fragments from larger DNA fragments is accomplished using the GeneCtean kit 
(Biol 01 , Ina San Diego, CA) using procedures supplied by the manufacturer, and renuval of salts and gel frag- 
ments from smaller DNA fragments is accomplished using the MerMaid kit (Biol 01, Inc. San Diego, CA) also 
using the procedures reconvnended by the manufacturer. 

30 

Procedure 111. Ligation of DNA Fragments. 

Ligatbn of DNA fragments using T4 DNA Ligase (New England Biolabs Beverly MA) is generally carri^ 
out as in Ausut>el et al (ibid). Unit 3.14. using conditions recommended by the manufacturer. 

55 

Procedure IV. Preparation of Competent E. c6H Cells and Transformation of E. coll. 

Preparation of competent E. coli CSH26 cells using calcium chloride and transfomnatton with DNA expres- 
sion vectors are carried out using the protocds described by Sambrook et al Molecular Cloning A Laboratory 
40 Manual 2nd Edition 1989, Chapter 1. 

Clones carrying plasmids containing DNA insertions are identified by growing cells on L-agar supplemented 
with 100 mg/L ampk:illin. 

Procedure V. Isolation of Plasmld DNA. 

45 

Plasmkl DNA is generally usubel et al (ibid) Chapter 1. Unit 1.7. 
Procedure VI. Preparation of Double-stranded DNA from Synthetic Single-stranded Oligonucleotides. 

so Oligonudeotides are synthesized using standard phosporamidite chemistry (0.2 micromole synthesis). 

Fragments are separated on a 20% potyacrylamide gel under denaturing conditions as described by Ausubel 

et al (ibid) Volume 1, Unit 2. 12 and eluted and desalted as described therein. 

Double-stranded DNA is assembled from upper strand and lower strand pairs of synthetic oligonudeotides 

mth overlapping regions of perfed complimentarity of 15 nucleotides at their 3' ends by heating a mixture of 
55 1 ug of each of the strands at 90''C for 5 minutes, followed by slow cooling to room temperature over a period 

of one to two hours. This short duplex region serves as template and primer for mutually primed synthesis of 

a complete double DNA strand with Sequenase. a T7 DNA polymerase obtained from US Biochemical, using 

protoccds supplied by the manufacturer. 
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Following duplex extension, the DMA double strand is purHied by agarose gel electrophoresis as described 
In Procedure II. 

Procedure VII. Preparation of Crude E. coll Cell Extract 

5 

E. coli host cells hartxiring the plasmid carrying the gene for the hybrid polypeptide are grown to stationary 
phase by overnight incubation in L-broth containing 100 mg/L ampicOlin at 42*'C, 250 RPM in a New Brunswick 
incubator-shaker. 

Cells are collected by centrifugatnn in a 6SA rotor at 5,000 X 6 for 30 minutes at 4°C in a Sorvall RC-5B 
10 centrifuge to pellet the cells. The supernatant is poured off, and the cell pellet is weighed. The cells are resus- 
pended in 1:2 (cell wet weight buffer volume) 100 mM potassium phosphate buffer, pH 6.8-7.2 (or appropriate 
buffer at pH 4.0 to 1 1.0), at 4*'C. 

The cells are then lysed by sonication using a large probe on Fisher sonic dismembranator Model 300 for 
3 one minute cydes at 95% relative output The resulting lysata is centrifoged at 17,500 RPM for 30 minutes 
15 at 4*^. To the resulting supernatant is added 2% (w:v) streptomycin sulfate. After incubatnn for 15 to 30 minutes 
at 4''C, the lysate is centrifuged at 17,500 RPM as above. The resulting supernatant is adjusted to 30% satur- 
atk>n with ammonium sulfate, incubated for 30 minutes at4°C, and centrifuged at 17,500 RPM as above. The 
supernatant is adjusted to 60% saturation with amiTkonium sulfate, incubated for 30 minutes at 4^C, then cen- 
trifuged at 17,500 RPM as above. 
20 The pellet formed by the addition of 60% ammonium sulfate is resuspended in 1 00 mM potassium phos- 
phate buffer, pH 7.0 to yield approximately 80 to 100 n^g/ml total protein, and is centrifuged at 17,500 RPM as 
above. The supematent is temried the crude extract. 

Procedure VIII. Avidin Monomer Affinity Chromatography. 

25 

The crude extract is applied to a 4mm X 5 cm column packed with avklin nwnomer affinity resin (US serial 
no. 414,785) on a LKB HPLC system equipped with two Model 21 50 pumps, Model 21 52 controller, and a Model 
2140 spectral dete<^. Sample absort)ance is nx>nitored at 280 nm. The crude extract is applied in 100 mM 
potassium phosphate buffer, pH 6.8 to 7.2 (or other appropriate buffer at a pH of 4.0 to 1 1.0} at a flow rate of 

30 0.1 ml /min to a column equilibrated writh the same buffer. 

After the sample is loaded, the column is washed with phosphate tniffer at a flow rate of 1 ml Anin until 
absorbance returned to the t>aseline at>sorbance. and all nonbound n^terial is washed from the resin. The col- 
umn is then equilibrated with water, followed by application of 5 ml of 2M NaCI. The column is reequilibrated 
with water. This same NaCI- water wash procedure is repeated four to five times. The samf^e is eluted using 

35 acetic acid or biotin, as deteiled in Procedure IX 

Procedure iX Eiution of the Hybrid Polypeptide from the Avidin Monomer Affinity Resin 

A Bution using acetic acid 

40 

Five ml of 10% glacial acetic add are applied to the column. Eluted hybrid polypeptide is collected until the 
at>sorbance at 280 nm returns to the baseline at>sorbance. 

B. Bution using btotin 

45 

Five ml of 10 mM biotin in 100 mM potassium phosphate buffer, plH 6.5 is applied to the column. Eluted hybrid 
polypeptkle is collected until the absort>ance at 280 nm retoms to the baseline absort>ance. 

Procedure X Cleavage of the Polypeptide of Interest from the Polypeptide for Attachment 

5D 

A Add deavage 

[Reference: London, M. (1977) Methods in Enzynrology 47:145-149.] 

The hybrid polypeptide suspension is adjusted to 70% fonmic add (v/v) and incubated at 40*'C fen- 24 to 48 
55 hours. The mixture is then fireeze-dried. Highly pure polypeptide of interest is obteined by Procedure VIII. 
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B. Cyanogen bromide cleavage of methionine residues [Reference: Gross, E. and B. Witkop (1961) journal 
of American Chemical Society 83:1510-1511.] 

The hybrid polypeptide is dissolved in 70% (v/v) aqueous formic acid at 23**C. A 50 molar excess of cya- 
5 nogen bromide is added in a small volunrw of 70% formic acid, with stimng. 

The mixture is incubated in the dark under nitrogen at 20-25°C for 16 to 24 hours. The mixture is then dOuted 
with 10 volumes of water and fineeze-dried. 

Highly pure polypeptide of interest is obtained by Procedure VIII. 

10 Prt>cedur« XI. Separation of the Polypeptide of Interest from the Polypeptide for Attachment 

The dried polypeptide mixture isresuspended in avidin monomer column loading buffer. 100 mM phosphate 
buffer pH 6.8-7.2, or other appropriate buffer ata pH of between 4,0 and 11.0. Highly pure polypeptide of interest 
is obtained by passing the cleaved polypeptide mixture over the avidin resin using the procedure descrit>ed 
15 above. 

The polypeptide for attachment is retained by the avidin monomer and the polypeptide of intwest is not 
retained. The polypeptide of interest is collected in the column flowthrough. 

Procedure XII. Plasmid Expression Vectors. 

20 

Two plasmids are obtained from D. Samols, Case Westem Reserve University. 

A. Plasmid ptac 1.3L This plasmid contains the DMA sequence coding for the 123 amino acid sequence 
of the 1.3S polypeptide of transcart>oxylase from Propionibacterium shermanii (SEQ ID N0:2). The DNA 
coding for the 1 .3S polypeptide is cloned as a 431 base pair fragment Into the polylinker regton of the ex- 

25 pression vector pKK223-3 as described by Murtif et al. (Proc Nat Acad Sci USA 82:5617-5621 (1 985)). 

B. Plasmid ptac1.3(1-125). The plasmkl ptac 1.3(1-125) is described by Murtif and Samols, J Bid Chem 
262:11813-11815 (1987). Like ptac1.3t, ptac1.3(1-125) also contains the 1.3S polypeptide but In addition 
has the sequence: 

GAT CXIA TAA CGC CTA AGC TT (SEQ ID NCh3) 

30 

at the 3' end of the 1.3S gene that encodes a BamHI restrictnn endonudease site. This DNA additional se- 
quence codes for the linking amino acid sequence asp-pro at the carboxyl terminus of the 1.3S polypeptide. 
In order to illustrate the nabjre of this invention and the manner of practicing the same, the following ex- 
35 amples are presented. 

Example 1. Modification of ptaci .3(1-125) to increase hybrid polypeptMe expression levels in E. coli. 

In order to increase the expression level of hybrid polypeptides produced from chimeric genes inserted into 
40 ptaci .3(1-125) from approximately 0. 1 % of total soluble cellular protein to approximately 5.0% of total soluble 
cell protein, ptaci .3(1-125) was nwdified as follows. ptac1.3(1-125) was digested with the restriction enzymes 
Xhol and Hindlll. The desired 131 base pair (bp) fragment was obtained by agarose gel electrophoresis. 

The vector ptac1.3t was also digested with Xhol and Hindlll using the conditions described above, and the 
4.86 kilobase (kb) fragment was obtained by agarose gel purificatk>n. The plasmid ptaci .3dp (Figure 1) was 
45 obtained by ligation of the 131 bp fragntent from ptac 1.3(1 -125) to the 4.86 kb fragment of ptaci. 3L 

The ligated plasmid mixture was used to transform competent E. coll HB101. An E coli done harboring 
ptaci .3dp was identified by restriction enzyme digestion of plasmids isolated from selected amplcill in-resistant 
E. coli cells. 

50 Example 2. Fusfon of a polypeptide of interest to the C-terminus of a polypeptide for attachment vnth an 
add deavage site between the polypeptides 

This exam^^e describes a hybrid polypeptide in which a synthetic t>-endorphin polypeptide is fused to the 
cartx)xyl terminus of the 1.3S polypeptide. An asp-pro cleavage site is incorporated between the two polypep- 
55 tides for deavage and subsequent puriTtcation of b-endorphin away from the 1 .3S polypeptide after affinity pur- 
ification using avidin monomer resin. 

The amino acid sequence of a modified b-endorphin polypeptide is shown in SEQ ID N0:6 and the con-e- 
sponding nudeotkle sequence coding for this amino acid sequence is shown in SEQ ID N0:7. 
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The synthetic oligonudeotides from which this synthetic gene was assembled are shown in SEQ ID NO:8 
(RHcbel) and SEQ ID N0:9 (RHcbe2). There are no internal methionine residues in the nrKKlified b-endorphin 
pdypeptide. 

The BamHI endonudease deavage recognition sequence GGATCC at nudeotide positions 12 through 17 
5 at the 5' end of SEQ ID N0:6 allows the introdudon of an ATG codon at the 5' terminus of the b-endorphin 
gene when this fragment is introduced into the BamHI site of ptac1.3dp (SEQ ID N0:5), thereby adding a m&- 
thionine at the N-terminus of the polypeptide. 

To maximize expression in E. coli. the amino acid sequence of authentic b-endorphin was reverse trans- 
lated into a DNA sequence using preferred codon usages of highly expressed E. coli genes (DeBoer, H Chapter 
10 8. Maximizing Gene Expression, W. Reznikoff and L. Gold, Eds.). 

RHcbel and RHcbe2 were synthesized, annealed and filled by T7 DNA polymerase as described in Pro- 
cedure VII. 

The resulting double stranded DNA sequence (SEQ ID NO:5) was digested with BamHI , gen^^ng 105 
bp fragment coding for the synthetic b-endorphin which was purified by agarose gel electrophoresis. 
15 The plasmid vector pUC19 (Sambrook et al. Molecular Qoning, A Laboratory Manual, 2nd Editton, Vol 1, 
1989. p. 1.13) was linearized with BamHI. The 5' tenninus of the linearized plasmid was dephosphorylated prior 
to ligation by incubating the digest mbcture with calf intestinal phosphatase using the protocol described by Sanv 
brook et al (ibid, Vol.1, pp. 3.38-3.39), to minimize self-ligation of the vector. 

Pure linear plasmid was recovered by agarose gel electrophoresis, and the 1 10 bp synthetic b-endorphin 
20 gene fragment was ligated to the pUC19 plasmid, and this Irgated plasmid was used to transform competent 
E. coliHBIOI. 

Following plasmid isolation from am picill in-resistant dones, the recombinant E. coli cells harboring the cor- 
rect plasmid were identified by restriction enzyme digestion. This recombinant plasmid containing the gene for 
the synthetic b-endorphin was designated pUC19end(MS3. 

25 

Cloning of synthetic b-ondorphin into ptad.Sdp. 

The b endorphin gene in pUC19endor63 was fused to the 3' tenminus of the 1.3S gene in ptac1.3dp as 
follows: a 105 bp b-endorphin gene fragment was generated by digestion of pUC19endorB3 with BamHI , and 

30 purified by agarose gel electrophoresis. 

The vector ptac1.3dp, which contains two BamHI sites, was partially digested with BamHI . Plasmkj DNA 
cut at only one BamHI site was purified by agarose gel electrophoresis. 

The 105 bp b-endorphin gene was ligated Into the BamHI site of the linearized ptac1.3dp [dasmid and the 
ligation mixture was used to transform competent E. coli HB101. 

35 The ligated plasmid containing the endorphin gene in the proper orientetlon was identified by restriction 
enzyme analysis of plasmids isolated from ampldllin-resistant transformed E. coli and was designated 
ptaci .3dp:endorB3 (Figure 2A). This plasmid codes for a hybrid fusion polypetide consisting of the 1 .3S poly- 
peptide fused at Its cartxixyl tenninus to an asp-pro deavage sequence fused to a synthetic b-endorphin poly- 
peptide containing a nr>ethk>nine residue at positk>n 1. 

40 A highly pure preparatk>n of the synthetic b-endorphin was obtained by inoculation of L- broth containing 

100 mg/l ampidllin with the E. coli host hartwring ptaci. 3dp:endorB3. A crude protein extract containing the 
1.3S:b-end<Mphin hybrid polypeptide was obtained by fdlowing Procedure Vll. Highly pure 1.3S:b-endorphin 
polypeptide was obteined by avidin monom^ affinity chromatography described in Proc^edure Vlli, using acotic 
acid to elute the purified hybrid polypeptkle from the resin (Prcxodure IX A). 

45 Cleavage of b-endorphin from the 1 .3S polypeptide was accomplished by incubation in formic add accxml- 
ing to Part A of Procedure X, and highly pure b-endorphin was obteined by avidin monomer affinity chroma- 
tography of the deavage mixture by repeating Procodure XI. Following add deavage of an asp-pro linking se- 
quence, a proline residue remains at the N-terminus of the deaved b-endorphin polypeptide. 

Clones contetning the b-endorphin gene fragment inserted in the opposite orientetion from 

so ptad .3dp:endor63 yielded a 1 .38 polypeptide fused to a novel 21 amino add reverse endorphin peptide join^ 
by the linking amino add sequence asp-pro. The gene designated pted .3dp:revendorB3 that encodes this nov- 
el peptide is shown in Figure 2 B. This polypeptide could be purified by avidin monomer chromatography (Pro- 
cedure VIII), eluted in high yidd and high purity from the column using acetic add (Procedure IX A). 

This example further denonstrates production of a hybrid polypeptide conteining a polypeptide for binding 

55 to avidin as an efFicadous method for obteining polypeptides of interest in high yield and high purity. 
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Example 3. Fusion of a polypeptide of interest to a polypeptide for attachment with a methionine deavage 
site between the two polypeptides. 

In this example, the synthetic b-endorphin polypeptide is fused at its N-tenminus to a methionine residue, 
5 this nrtethionine residue being positioned at the C-terminus of the 1.3S polypeptide, thus providing a single ami- 
no acid cleavage site for separation of b-endorphin from the 1.3S polypeptide following avidin nwnomer chro- 
matography. 

Cleavage with cyanogen bromide yields an unmodified N-temiinus on b-endorphin, as the methionine is 
cleaved from the N-terminus of b-endorphin. 

10 The vector ptac1.3dp is digested to completion with Xhol and HIndlll, and the 131 bp fragment is purified 
by agarose gel electrophoresis. This 1 31 bp fragment is subjected to partial digestion with Sau3A, and the 1 1 0 
bp fragment so generated is isolated and purified by agarose gel electroph<»esis. This Xhol-Sau3A fragnferir" 
isllgated to a double-stranded synthetic DNA fragment (SEQ ID NO:12) coding for b-endorphin. This b-endor- 
phln has a methionine residue at its N-temninus. This fragment in SEQ ID NO:12 is assembled from synthetic 

15 oligonucleotides SEQ ID NO:10 and SEQ ID NO:1 1 as described in Procedure VI, and is digested to completion 
with Sau3A prior to ligation to the 110 bp Xhol-Sau3A fragment The 220 bp product of this ligation is purified 
by agarose gel electrophoresis. 

The vector ptaci .3dp is subjected to partial digestion with Xhol and BamHI , and the 4886 bp linear vector 
is purified by agarose gel electrophoresis. The ligated plasmid is used to transform competent E. coli CSH26, 

20 prepared according to Procedure IV. Plasmids are isolated from transformed am picill in-resistant E. coli clones, 
and a plasmid containing the desired gene in the correct orientation is identified by restriction enzyme analysis 
and designated ptaci .3dp:metendor (Figure 2C). 

A highly pure preparation of the synthetic t>-endorphin is obtained by inoculation of L-broth containing 100 
mg/l ampidllin with the E. coli host harboring ptac1.3:metendor. A crude protein extract containing the 1 .3S:b- 

25 endorphin hybrid polypeptide is obtained by following Procedure VII. 

Highly pure 1.3S:b-endorphin polypeptide is obtained by avidin monomer affinity chromatography descri- 
bed in Procedure VIII, using elution with acetic acid to elute the purified h^rid polypeptide from the resin (pro- 
cedure VIIIA). Cleavage of b-endorphin from the 1 .3S polypeptide was accomplished by incubation in cyanogen 
bnmiide according to Part B of Procedure X, and highly pure t>-endorphin is obtained by avidin monomer affinity 

30 chromatography of the cleavage mixture by repeating Procedure XI. 

Example 4. Fusion of a polypeptide of interest directly to the carboxyl terminus of a polypeptide for attach- 
ment, with no linking amino add sequence being present in the hybrid polypeptide. 

35 In this example, a synthetic b-endorphin polypeptide is ^sed directly to the C-terminus of the 1.3S poly- 
peptide. Avidin monomer chromatography is used to obtain highly pure b-endorphin in the fomi of a hybrid fu- 
sion polypeptide. 

The construct ptac1.3t (Procedure Xll) is digested to completion with Xhol and Hindlll, and the smaller 131 
bp fragment is purified by agarose gel electropfioresis. This 131 bp fragment is subjected to partial digestion 
40 with Sau3A I, and the 1 10 bp fragment is agarose-gel purified. THe'l 10 bp fragment is ligated to the double 
stranded DMA fragment SEQ ID NO: 15. 

SEQ ID NO:1 5 encodes a synthetic b-endorphin gene with no DNA coding for a linking amino add or amino 
add sequence at its 3' terminus. 

SEQ ID N0:15 is assembled from synthetic otigonudeotides SEQ ID N0:13and SEQ ID N0:14 using Pro- 
45 cedure VI. 

Prior to ligatbn to the 110 bp fragment SEQ ID NO:15 is digested to completion with Sau3A I. The 217 
bp ligation product is purffled by agarose gel electrophoresis. 

Vector ptaci. 3dp is linearized by partial digestion with Xhol and BamHI . and the 4886 bp fragment is also 
purified using agarose gel electorphoresis. 
50 The ligated plasmid is used to transform competent E. coli CSH26. 

Recombinant plasmkis are isdated from ampicill in-resistant transformants, and a done containing the de~ 
sired gene in the correct orientation is identified by restriction enzyme analysis and designated ptaci .3:endor 
(Figure 2D). 

A highly pure preparation of the synthetic b-endorphin is obtained by inoculation of L-broth containing 100 
55 mg/l ampk^Olin with tiie E. coli host harboring ptac1.3:endor. 

A crude protein extract containing the the 1.3S:b-endorphin hybrkl polypeptide isobteined by following Pro- 
cedure VII. 

Highly pure 1.3S:b-endorphin polypeptide is obteined by avidin nx>nomer affinity chromatography as de- 
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scribed in Procedure VIII, using elution with acetic acid to elute the purified hybrid polypeptide from the resin 
(procedure VI II A). 

Example 5. Fusion of two polypeptkjes of interest to the C-terminus of a single polypeptide for attachment in 
5 the order, polypeptide for attachmentpolypeptide of interestpolypeptide of interest 

In this example, two b-endorphin polypeptides in tandem are fused to the C-terminus of one 1 .3S polypep- 
tide, with the linking amino adds asp-pro-nriet separating the first b-endorphin polypeptide from the C-terminus 
of the 1 .3S pdypeptide and another sequence of asp-pro-met separating the first b-endorphin from the second 
10 b-endorphin polypeptide. Such a fusion doubles the yield of the polypeptide of interest, at the same time pro- 
viding a means for purification of that polypeptide by avid in affinity chromatography. 

The plasmid ptacl.Sdp (Figure 1) is digested to completion with Xhol and BamHI. The resulting 118 bp 
firagnrient that encodes for the sequence beginning at amino acids 8 to the asp-pro site generated by the BamHI 
site at the 3' terminus of the 1.3S polypeptide as found in SEQ ID NO:5. This fragment is purified by agarose 
IS gel electrophoresis. 

Two tandem t>-endorphin polypeptides are generated by the creation of the double-stranded DNA frag- 
ments SEQ ID N0:18, assembled from digonudetides SEQ ID NO:16 and SEQ ID NO:17; and SEQ ID NO:21, 
assembled from SEQ ID N0:19 and SEQ ID NO:20. 

SEQ ID NO:18 and SEQ ID NO:21 were assembled from their respective oligonuleotides using the syn- 
20 thesis and strand assembly strategy described in Procedure Vl. 

SEQ ID NO:18 and SEQ ID N0:21 are digested with BamHI and ligated. 

This fragment codes for two b-endorphin pdypeptides in tandem, separated from each other by an asp- 
pro deavage sequence. The dimeric product of ligation is purified by agarose gel electrophoresis, and this frag- 
ment is l^ated to the 1 18 bp Xhol-BamH1 1 .3S partial coding sequence obtained from ptaci .3dp. This ligation 
25 product is purified by agarose gel dectrophoresis, and is ligated to a 4886 bp fragment generated by partial 
digestion of ptac1.3dp linearized by partial digestion with Xhol and BamHI. 

Plasmid DKtA is isolated from transformed E. coli HB101. Plasmids containing the correct chin>eric gene 
orientetion aro confirmed by restriction endonudease mapping and designated ptaci. 3: endonendor (Figure 
3A). 

30 A highly pure preparation of synthetic b-endorphin is obtained by inoculation of L-broth containing 1 00 mg/l 
ampicQlin with the E. coli host hartx)ring ptaci. 3:endorendor. 

A crude protein extrad containing the 1.3S: b-endorphin: b-endorphin hybrid polypeptide is obteined by fol- 
lowing Procedure VII. Highly pure 1.3S:b-endorphin: b-endorphin pdypeptide is obtained by avidin nronomer 
affinity chronnatography described in Procedure Vlll, using acetic add to elute the purified hybrid pdypeptide 
35 from the resin (procedure VillA). 

Cleavage of both tvendorphin polypeptides from the 1.3S pdypeptide in a single step is accomplished by 
incubation in fonmic add according to Part A of Procedure X, and highly pure b-endorphin is obtained by avidin 
monomer affinity chromatography of the deavage mbtture by repeating Procedure XI. 

40 Example 6. Fusion of one polypeptide of interest to the N-terminus of a pdypeptide for attachrrrent and fu- 
sion of a second polypeptide of interest to the C-terminus of the same pdypeptide for attachment 

In this example, the maltose binding protein (Guan. C. et al., Gene 67-21-30 (1987) and Maine, et al., Gene 
74:365-373 (1988)} was fused to the N-terminus of the 1.3S pdypeptide and syntiietic b-endorphin was fused 

45 to the C-terminus of the same 1 .3S polypeptide, thus creating a hybrid polypeptide consisting of two drflerent 
noncontiguous polypeptides of interest 

The (x>nstrudptec1 .3dp:endorB3 (Figure 2A) was digested with Sail and Hindi II. A 438 bp fragment created 
was purified by agarose gel electrophoresis. This fragnient encodes amino adds 19 to 123 of the 1.3S poly- 
peptide, the asp-pro-met linking amino adds, and the 31 amino ackl b-endorphin polypeptide. 

50 The vedor pMAL-c (obtained from New England Biolabs) was linearized by digestion with Sail and Hindlll. 
This vedor contains the maltose binding protein under the regulation of the tac promoter (Guan, C. et al., Gene 
67-21-30 (1987) and Maine, et at. Gene 74:365-373 (1988)). The linearized vedor and the 438 bp 1.3S-b-erh 
dorphin fragment were ligated, and the ligation mix was used to transfonm competent E. coli CHS26. Plasmid 
DNA was isolated, and plasmids containing the cxxrect chinneric gene orientation were corrfinned by restriction 

55 endonudease mapping. The resulting clone was designated ptac:ma]B:1.3:endor63 (Figure 3B). 

A highly pure preparation of the hybrid maltose binding protein-synthetic t>-endorphin polypeptide is ob- 
teined by inoculation of L-broth conteining 100 mg/l ampicOlin with the E. cdi host harboring ptec:malB: 1.3:erh 
dorB3. 
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A crude protein extract containing the hybrid polypeptide is obtained by following Procedure VII. 

Highly pure maltose binding protein:1.3S:b- endorphin hybrid polypeptide is obtained by avidin monomer 
affinity chromatography described in Procedure VIII, using biotin to elute the purrTted hybrid polypeptide from 
the resin (Procedure VIIIB). 

5 Biotin Is removed from the polypeptide suspension by dialysis against three changes of 1 00 mM ammonium 

cart)onate buffer, pH 7.2, followed by freeze-dryrng of the sample. 

Cleavage of t>-endorphin from the 1.3S polypeptide is accomplished by incubation in cyanogen bromide 
according to Part B of Procedure X, and highly pure b endorphin is obtained by avidin monomer affinity chro- 
matography of the cleavage mixture by repeating Procedure XI. 

10 The maltose-binding protein-1.3S hybrid polypeptide was recovered in highly pure fonm by repeating the 
biotin elution procedure descrit)ed in Procedure VIIIB. 



Example 7. Fusion of two polypeptkies of interest in tandem to the N-terminus of a polypeptide for attach- 
ment, with an amino add cleavage sequence separating the first pdypeptide of interest from the polypep- 
15 tide for attachment, and a second amino acid cleavage sequence separating the first polypeptide of interest 
from the second polypeptide of interest 

The maltose binding protein and t>-endorphin are fused in tandem to the amlno-terminus of the 1.3S poly- 
peptide. An amino acid cleavage site separates the maltose binding protein and t>'endorphin, and another ami- 

20 no acid cleavage site separates b-endorphin and the 1.3S polypeptide. 

The plasmid ptac1.3dp is digested with Hindi and Hindlll. The 332 bp fragment is purified by agarose gel 
electrophoresis. Linkers encoding a BamHI recognition sequence (CGGATCCG) are ligated to this fragment, 
and the fragment is digested with BamHI to generate a BamHI site at the 5' terminus of the fragment The 338 
bp BamHI -Hindlll fragment so generated is purified by agarose gel electrophoresis . 

25 The DNA fragment SEQ ID NO: 15 is digested with BamHI , and is ligated to the 338 bp BamHI-Hrndlll nrK>di- 
fied fragmentfrom ptac1.3dp. The desired 444 bp fragnnent is purified by agarose gel eledrophoresis. The vec- 
tor pMAL-c (New England Biolabs. Beverly, MA) is digested to a 6.1 kb fragment with Hindlll and BamHI . and 
the large fragment is purified by agarose gel electrophoresis. The 444 bp fragment and the 6.1 kb fragnr>ent 
are ligated, and the ligatton mbc is used to transform competent E. coli CSH26. 

30 Plasmid DNA is isolated, and plasmids containings the chimeric gene in the correct orientation are con- 
firmed by restriction endonudease mapping, and are designated ptac:ma]C:endorB3: 1 .3dp (Fig. 3C}. The cor- 
red recombinant plasmid codes for a fusion protein composed of a 42,000 MW maltose binding protein fused 
by an asp-pro-met linker to t>-endorphin join^ by an asp-pro linker to amino acids 19-123 of the 1.3S poly- 
peptide having an asp-fxo cart)oxyl temiinus. 

35 Highly pure maltose binding protein:b- endorphin: 1 .3S hybrid polypeptkle is obtained by avidin monomer 
affinity chromatography described in Procedure VIII. using bk)tin to elute the purified hybrid pdypeptide from 
the resin (Procedure VIIIB). B»tin is removed from the polypeptide suspension by dialysis against three 
changes of 100 mM ammonium carbonate buffer, pH 7.2. followed by freeze-drying of the sample. 

Cleavage of b-endorphin from the 1 .3S polypeptkle is accomplished by incubation in formic add according 

40 to Part A of Procedure X, and highly pure t>-endorphin is obtained t>y avidin monomer affinity chromatography 
of the deavage mixture by repeating Procedure XI. The maltose binding protein is recovered in highly pure form 
by elution of the maltose binding protein:1 .3S hyt)rid polypeptkle from the avklin monomer resin with btotin using 
Part B of Procedure IX. 

The hybrid polypeptkle is purified away from the biotin by dialysis against three changes of 100 mM am- 
45 monium carbonate buffer, pH 7.2, followed by freeze-drying of the sample. The sample is freeze-dried, and 
reconstituted in cyanogen bromide according to Part B of Procedure X. The maltose-binding protein is recov- 
ered in highly pure fomn by repeating the avidin monomer chromatography process detailed in Procedure XI. 



Example 8. Fuston of two polypeptkies of interest to the C-termini of two polypeptkJes for attachment within 
50 the same hybrid polypeptide. 



In this example, two noncontiguous b-endorphln polypeptides are fused to btto noncontiguous 1.3S poly- 
peptides within one hybrid polypeptide with a deavage amino add sequence between each 1 .38 polypeptide 
J and the t>-endorphin to which it is directly linked, producing the fusion hybrid polypeptide 1 .3S:asp-pro-met t>- 

55 endorphtn:asp-pro:1.3S:asp-pro-metb-endorphin. 

The vector 1 .3dp:endor63 (Figure 2A) is digested with Hindi and Hindlll and the 437 bp fragment is purified 
by agarose gel electrophoresis. Synthetk: DNA linkers encoding a BamHI recognition sequence, CGGATCCG 
(New England Biolabs, Inc. Beverty, MA) are ligated to this 437 bp fragment 
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Following ligation, this DNA is subjected to BamHI digestion to generate BamHI cohesive termini, and the 
443 bp BamHI-Hindlll fragment Is purified by agarose gel electrophoresis. The ptac 1 .3:endonendor (Figure 
3A) Is partially digested with BamHI and l-lindlll to yield a linear DNA of 5.1 kb digested at a single BamHI site 
and also at a single Hindlll site. This 5.1 kb fragment is purified by agarose gel electrophoresis, then is ligated 
5 to the 443 bp fragment. The iigatton mix is used to transform competent E. cdi HB101. 

Plasmid DNA is Isdated from ampicill in-resistant transfonned E. coll, and plasmids containing the correct 
chinneric gene orientation are confirmed by restriction endonudease mapping. 

The recombinant plasmid obtained by this procedure is designated ptad .3:endon 1 .SiendCH* (Figure 4) and 
encodes a hybrid polypeptide which permits the isolation of two mcdecules of b-endorphin for ev^ hybrid poly- 
10 peptide purified, which may double the yield of polypeptide from a single fermentation. 

A highly pure preparation of synthetic b-endorphin is obtained by inoculation of L-broth containing 100 mg/l 
ampk^Olin with the E. coli host hart)oring ptaci .3: endor 1 .3:endor. A crude protein extract containing the 1 .3S:b- 
endorphin:1.3S:b-endorphtn hybrid polypeptide is obtained by following Procedure VII. 

Highly pure 1 .3S: b-endorphin: 1 .3S :b-endorphin polypeptide is obtained by avidin monomer affinity chro- 
15 matography descrit>ed in Procedure VIII, using acetic acid to elute the purified hybrid pdypeptkJe from the resin 
(procedure VIIIA). 

Cleavage of t)oth b-endorphin polypeptides from t>oth 1.3S polypeptides in a single step is accomplished 
by incubatk>n in formic acid according to Part A of Procedure X, and highly pure t>-endorphin is obtained by 
avidin monomer affinity chromatography of the cleavage mixture by repeating R-ocedure XI. 

20 
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SEP ID NO 2 

SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH: 43 

MOLECULE TYPE* Peptide 

ORIGINAL SOURCE ORGANISM: Bacterium 

SOURCE NAME: Propionibacterium shermanii 

PROPERTIES: Biotin-binding recognition sequence 

Pro Ala Pro Leu Ala Gly Thr Val Ser Lys lie Leu Val Lys Glu Gly 

1 5 10 15 

Asp Thr Val Lys Ala Gly Gin Thr Val Leu Val Leu Glu Ala Met Lys 

20 25 30 

Met Glx Thr Glu He Asn Ala Pro Thr Asp Gly 
35 40 

SEOIDNO:3 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 20 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 

PROPERTIES: Termination fragment for a BamHI deavage site 
GATCCATAAC GCCTAAGCTT 
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SEOIDNO:4 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 372 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE ORGANISM: Bacterium 
SOURCE NAME: Propionibacterium shenTtanii 
PROPERTtES: Gene coding for L3S polypeptide 



ATGAAACTGA AGGTAACAGT CAACGGCACT GCGTATGACG 40 

TTGACGTTGA CGTCGACAAG TCACACGAAA ACCCGATGGG 80 

CACCATCCrGTICGGCGGCGGCACCGGCGGCGCGCCGGCA 120 

GCGCGCGCAGCAGGTGGCGCAGGCGCCGGTAAGGCCGGAG 160 

AGGGCGAGATTCCCGCTCCGCTGGCCGGCACCGTCTCCAA 200 

GATCCTCGTG AAGGAGGGTG ACACGGTCAA GGCTGGTCAG 240 

ACCGTGCTCG TTCTCGAGGC CATGAAGATG GAGACCGAGA 280 

TCAACGCTCCCACCGACGGCAAGGTCGAGAAGGTCCITCT 320 

CAAGGAGCGTGACGCCGTGCAGGGCGGrCAGGGTClCATC 360 

AAGATCGGCr GA 372 
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SEOIDNChS 

5 SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 390 

STRANDEDNESS: Double-stranded 
10 TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

PROPERTIES: Oligonucleotide ptacl.3dp 

15 

ATGAAACTGA AGGTAACAGT CAACGGCACT GCGTATGACG 40 
TTGACGTTG A CGTCGACAAG TCACACGAAA ACCCGATGGG 80 
^ CACCATCCTG TTCGGCGGCG GCACCGGCGG CGCGCCGGCA 120 
CCGCGCGCAG CAGGTGGCGC AGGCGCCGGT AAGGCCGGAG 160 
AGGGCG AGAT TCCCGCTCCG CTGGCCGGCA CCGTCTCCAA 200 
GATCCTCGTG AAGGAGGGTG ACACGGTCAA GGCTGGTCAG 240 
ACCGTGCTCG TTCTCGAGGC CATGAAGATG GAGACCGAGA 280 
TCAACGCTCC CACCG ACGGC AAGGTCGAGA AGGTCCTTGT 320 
CAAGGAGCGTGACGCCGTGCAGGGCGGTCAGGGTCTCATC 360 
AAG ATCGGCT GATCCATAAC GCCTAAGCTT 390 

SEOIDNa6 

SEQUENCE TYPE: Peptide 
SEQUENCE LENGTH: 31 

40 

MOLECULE TYPE: Peptide 

PROPERTIES: Modified b-endorphin polypeptide 

45 

Tyr Gly Gly Fhe Leu Thr Ser Glu Lys Ser Gin Thr Pro Leu Val Thr 
15 10 15 

^ Leu Phe Lys Asn Ala He He Lys Asn Ala Tyr Lys Lys Gly Glu 
20 25 30 
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SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 135 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 

PROPERTIES: Gene coding for modified b-endorphin polypeptide 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTG ACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 
CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 
TCCGAATTCG AGCTC 



SEP ID Ng 8 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Oligonucleotide RHcbel 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCIGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 
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SEOiDNgg 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOIXXJY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Oligonucleotide RHd)e2 

GAGCTCGAAT TCGGATCCTT ATTCGCCnT TTTGTATGCG 
mTTGATGA TAGCGTnTT GAACAGAGTA ACCAG 



SEOIDNO.IO 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

AAGCTTCTAG AGATCGGCAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTFACTC TGTTC 
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SEOIDNChll 

SEQUENCE TYPE: Nudeotid 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Ctouble-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

GAGCTCGAAT TCGGATCCTT ATTCGCCTTT TTTGTATGCG 
ilill GATGA TAGCGTrm GAACAGAGTA ACCAG 



SEOIDNai2 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 135 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGATCGGCAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 
CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 
TCCG AATTCG AGCTC 
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SEQINNO:13 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STEIANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

AAGCrrCTAG AGATCGGCTA CGGTGGTTTC CTGACCTCCG 
AAAAATCTCA GACCCCGCTG GTTACTCTGT TCAAA 



SEOIDNO:14 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 72 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

GAGCTCGAAT TCGGATCCTT ATFCGCCnT TITGTATGCG 
TTTITGATGA TAGCGIITIT GAACAGAGTA AC 
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SEOIPNO:15 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 132 
STRANDEDNESS: Double-stranded 
TOPOLCX5Y: linear 
MOLECULE TYPE: Geaiomic DNA 



AAGCTTCTAG AGATCGGCTA CGGTGGTFrC CTGACCTCCG 
AAAAATCTCA GACCCCGCTG GTTACTCTGT TCAAAAACGC 
TATCATCAAA AACGCATACA AAAAAGGCGA ATAAGGATCC 
GAATTCGAGC TC 



SEOIDNai6 

30 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 

35 

TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
^ PROPERTEES: Synthetic oligonucleotide 

45 AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 40 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 75 



40 
80 
120 
132 
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SEOIDNai7 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 71 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

GAGCTCGAAT TCGGATCCTC GCCTmTTG TATGCGnTT 40 
TGATGATAGC GmTTGAAC AGAGTAACCA G 71 



SEOIDNO:18 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 130 
STRANDEDNESS: Double-Stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TrCCTGACCT 40 

CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 80 

CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAGGATCCG 120 

AATTCGAGCTC 130 
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SEOIDNO:19 

SEQUENCE TYFE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOIjOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTC 



SEQIDNa20 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

G AGCTCGAAT TCGGATCCTT ATTCGCCnT TFrCTATGCG 
TmTGATGA TAGCGmTT GAACAGAGTA ACCAG 
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SEOUEMCE LISTING 
SEO.IDNO:! 

SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH: 123 

MOLECULE TYPE: Protein 

ORIGINAL SOURCE ORGANISM: Bacterium 

SOURCE NAME: Propionihacterium shermanii 

FEATURES: From 58 to 100 - biotin-binding recognition sequence 

PROPERTIES: 1.3S biotin-binding protein 



Met Lys Leu Lys Val Thr Val Asn Gly Thr Ala Tyr Asp Val Asp Val 
1 5 10 15 

Asp Val Asp Lys Ser His Glu Asn Pro Met Gly Thr lie Leu Phe Gly 

20 25 30 

Gly Gly Hur Gly Gly Ala Pro Ala Pro Arg Ala Ala Gly Gly Ala Gly 
35 40 7^ 45 



Pro Ala Pro Leu Ala Gly Thr 

^ 60 



Ala Gly Lys Ala Gly Glu Gly Glu He 

50 55 
Val Ser Lys lie Leu Val Lys Glu Gly Asp Thr Val Lys Ala Gly Gin 
65 70 75 80 

Thr Val Leu Val Leu Glu Ala Met Lys Met Gbc Thr Glu lie Asn Ala 
85 

Pro Thr Asp Gly Eys Val Glu Lys Val Leu Val Lys Glu Arg Asp Ala 



100 105 
Val Gin Gly Gly Gin Gly Leu He Lys He Gly 
115 120 



110 
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SEOIDNO:21 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 135 
STEIANDEDNESS: Double-stranded 
TOPOIjOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 40 

CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 80 

CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 120 

TCCGAATTCG AGCTC 135 



Claims 

1. A recombinant hybrid polypeptide comprising a polypeptide of interest fused to an avidin- binding polypep- 
tide conteining a biotin attachment.donoaifu characterised in that the polypeptide of interest is fused to the 
C tsnninus of the avidin-binding polypeptide. 

2. A recombinant hybrid polypeptide according to da im 1 wherein biotin is attached to the avidin-binding poly- 
peptide. 

3. A recombinant hybrid polypeptide according to daim 1 or daim 2 wherein the polypeptide indudes a deav- 
age site for deaving the polypeptide of interest from the avidin-binding polypeptide. 

4. A recombinant hybrid polypeptide^^aci^ding to any one of daims 1 to 3 wherein the avidin binding poly- 
peptide is, or is part o ^^J .SS^ gj ^ggg^^^ 

5. A recombinant hybrid polypeptide according to claim 4 wherein the 1 .3S polypeptide is from Propionibac- 
terium. ■ 

6. A recombinant hybrid polypeptide according to any one of datnns 1 to 5 wherein the biotin attachment do- 
main of the avidin-binding polypeptide comprises at least one of the seq uence Pro Ala Pro Leu Ala Gly 
Thr Val Ser Lys lie Leu Val Lys Glu Gly Asp Thr Va) Lys Ala Gly Gin Thr Val Leu Val Leu Glu Ala Met Lys 
Met Glu Thr Glu He Asn Ala Pro Thr Asp Gly. ^ 

7. A recombinant hybrid polypeptide according to any one of daims 1 to 6 wherein the avidin-binding poly- 
peptide comprises a plurality of non-contiguous and/or contiguous avidin-binding polypeptides, which may 
be the same or differenL 

8. A recombinant hybrid pdypeptide according to any one of daims 1 to 6 wherein the polypeptide of interest 
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comprises a plurality of non-contiguous and/or contiguous polypeptides of interest which may be the same 
or different 

9. A recombinant hybrid pdypeptide according to any one of claims 1 to 8 wherein the polypeptide of Interest 
is an enzyme, is an antigen useful for vaccine production, or a diagnostic reagent. 

10. A recombinant hybrid polypeptide according to any one of claims 1 to 9 wherein the polypeptide of interest 
has antitumor activity or has an amino acid sequence fw recognition of antigens. 

11. A nucleic acid sequence coding for a hybrid polypeptide as defined In any one of darms 1 to 10 wherein 
the nudeic acid sequence comprises a nudeic add sequence coding for an avidin-binding polypeptide 
upstream of a nudeic add sequence coding for a polypeptide of Interest. 

12. A nudeic acid sequence according to daim 11 wherein the nudeic add sequence is a DNA sequence. 

13. A nudeic add sequence according to daim 12 wherein the DNA sequence contains In a 5' to 3' direction 
on the coding strand a gene comprising a 5' promoter region, the ONA sequence coding for the avidin- 
binding polypeptide and the DNA sequence coding for the polypeptide of interest 

14. A nudeic acid sequence according to daim 12 or daim 13 wherein the DNA sequence is, or is part of. an 
expression vector or a plasmkJ. 

15. A process for the production of a hybrid polypeptide as defined in any one of daims 1 to 10 comprising 
constructing a plasmid containing a nudeic acid sequence as defrned In any one of dainns 1 1 to 13, trans- 
forming the plasmid Into a procaryotic or eucaryotic host cell expression system, expressing tiie system, 
contacting the hyt}rid polypeptide resulting from the expression system with avidin, and harvesting the re- 
sulting avidin- bound hybrid polypeptide. 

16. A process according to daim 15 wherein the expression system Is either E. Coli or Insect cells. 

17. A process for the isdafa'on of a hybrid polypeptide as defined in any one of daims 1 to 10 comfvislng con- 
tacting the hybrid pdypeptide with avidin. 

18. A process according to any one of claims 1 5 to 1 7 wherein the avidin is monomeric avidin, tetramern avidin 
orstreptavidin. 

19. A process according to any one of daims 15 to 18 wherein the polypeptkje of interest is deaved from the 
isolated hybrid polypeptide. 

20. A process according to any one of daims 15 to 19 wherein the hybrid polypeptide is isolated using avidin 
covalentiy bound to a chemically inert solid, water and solvent insoluble substrate through a chemically 
stable non-hydrolyzable linking group, preferably the hybrid polypeptide Is Isolated using avidin monomer 
affinity chromatography. 

21. A first kit comprising a hybrid polypeptide as defined in any one of daims 1 to 10 and avidin. 

22. A second kit comprising a nudeic add sequence as defined in any one of daints 11 to 14 and avidin. 

23. A third kit comprising a nudeic add sequence which codes for an avklin-binding polypeptide containing 
a biotin attachment domain and which is fiisable to a nudeic acid sequence coding for a polypeptide of 
Interest in order to form a hybrid nucleic add sequence as defined in any one of daims 11 to 14 and avidin. 

24w A kit according to daim 23 wherein the kit comprises means to fuse the nudeic add sequence coding for 
the avkl in-binding polypeptide to the nudeic acid sequence coding for the polypeptide of Interest in order 
to form the nudeic acid sequence as defined in any one of daims 11 to 14. 

25. A kit according to any one of claims 21 to 24 wherein the kit comprises means to deave the polypeptide 
of interest from the avidin-binding pdypeptide. 
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